12 years ago · fb3a565eed
--- a/doc/design/resolver/03-cache-algorithm.txt
+++ b/doc/design/resolver/03-cache-algorithm.txt
@@ -20,3 +20,90 @@ which algorithm is best for that. If it is very critical, then a
 
																 custom algorithm designed for DNS caching makes sense. If it is not,
															
 
																 then we can consider using an STL-based data structure.
															
 
																+Effectiveness of Cache
															
 
																+----------------------
															
 
																+
															
 
																+First, I'll try to answer the introductory questions.
															
 
																+
															
 
																+In some simplified model, we can express the amount of running time
															
 
																+for answering queries directly from the cache in the total running
															
 
																+time including that used for recursive resolution due to cache miss as
															
 
																+follows:
															
 
																+
															
 
																+A = r*Q2*/(r*Q2+ Q1*(1-r))
															
 
																+where
															
 
																+A: amount of time for answering queries from the cache per unit time
															
 
																+   (such as sec, 0<=A<=1)
															
 
																+r: cache hit rate (0<=r<=1)
															
 
																+Q1: max qps of the server with 100% cache hit
															
 
																+Q2: max qps of the server with 0% cache hit
															
 
																+
															
 
																+Q1 can be measured easily for given data set; measuring Q2 is tricky
															
 
																+in general (it requires many external queries with unreliable
															
 
																+results), but we can still have some not-so-unrealistic numbers
															
 
																+through controlled simulation.
															
 
																+
															
 
																+As a data point for these values, see a previous experimental results
															
 
																+of mine:
															
 
																+https://lists.isc.org/pipermail/bind10-dev/2012-July/003628.html
															
 
																+
															
 
																+Looking at the "ideal" server implementation (no protocol overhead)
															
 
																+with the set up 90% and 85% cache hit rate with 1 recursion on cache
															
 
																+miss, and with the possible maximum total throughput, we can deduce
															
 
																+Q1 and Q2, which are: 170591qps and 60138qps respectively.
															
 
																+
															
 
																+This means, with 90% cache hit rate (r = 0.9), the server would spend
															
 
																+76% of its run time for receiving queries and answering responses
															
 
																+directly from the cache: 0.9*60138/(0.9*60138 + 0.1*170591) = 0.76.
															
 
																+
															
 
																+I also ran more realistic experiments: using BIND 9.9.2 and unbound
															
 
																+1.4.19 in the "forward only" mode with crafted query data and the
															
 
																+forwarded server to emulate the situation of 100% and 0% cache hit
															
 
																+rates.  I then measured the max response throughput using a
															
 
																+queryperf-like tool.  In both cases Q2 is about 28% of Q1 (I'm not
															
 
																+showing specific numbers to avoid unnecessary discussion about
															
 
																+specific performance of existing servers; it's out of scope of this
															
 
																+memo).  Using Q2 = 0.28*Q1, above equation with 90% cache hit rate
															
 
																+will be: A = 0.9 * 0.28 / (0.9*0.28 + 0.1) = 0.716. So the server will
															
 
																+spend about 72% of its running time to answer queries directly from
															
 
																+the cache.
															
 
																+
															
 
																+Of course, these experimental results are too simplified.  First, in
															
 
																+these experiments we assumed only one external query is needed on
															
 
																+cache miss.  In general it can be more; however, it may not actually
															
 
																+too optimistic either: in my another research result:
															
 
																+http://bind10.isc.org/wiki/ResolverPerformanceResearch
															
 
																+In the more detailed analysis using real query sample and tracing what
															
 
																+an actual resolver would do, it looked we'd need about 1.44 to 1.63
															
 
																+external queries per cache miss in average.
															
 
																+
															
 
																+Still, of course, the real world cases are not that simple: in reality
															
 
																+we'd need to deal with timeouts, slower remote servers, unexpected
															
 
																+intermediate results, etc.  DNSSEC validating resolvers will clearly
															
 
																+need to do more work.
															
 
																+
															
 
																+So, in the real world deployment Q2 should be much smaller than Q1.
															
 
																+Here are some specific cases of the relationship between Q1 and Q2 for
															
 
																+given A (assuming r = 0.9):
															
 
																+
															
 
																+70%: Q2 = 0.26 * Q1
															
 
																+60%: Q2 = 0.17 * Q1
															
 
																+50%: Q2 = 0.11 * Q1
															
 
																+
															
 
																+So, even if "recursive resolution is 10 times heavier" than the cache
															
 
																+only case, we can assume the server spends a half of its run time for
															
 
																+answering queries directly from the cache at the cache hit rate of
															
 
																+90%.  I think this is a reasonably safe assumption.
															
 
																+
															
 
																+Now, assuming the number of 50% or more, does this suggest we should
															
 
																+highly optimize the cache?  Opinions may vary on this point, but I
															
 
																+personally think the answer is yes.  I've written an experimental
															
 
																+cache only implementation that employs the idea of fully-rendered
															
 
																+cached data.  On one test machine (2.20GHz AMD64, using a single
															
 
																+core), queryperf-like benchmark shows it can handle over 180Kqps,
															
 
																+while BIND 9.9.2 can just handle 41K qps.  The experimental
															
 
																+implementation skips some necessary features for a production server,
															
 
																+and cache management itself is always inevitable bottleneck, so the
															
 
																+production version wouldn't be that fast, but it still suggests it may
															
 
																+not be very difficult to reach over 100Kqps in production environment
															
 
																+including recursive resolution overhead.