Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Z
ZODB
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
Nicolas Wavrant
ZODB
Commits
04d1d290
Commit
04d1d290
authored
Jul 26, 2005
by
Tim Peters
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
More words; update examples w/ new output.
parent
b2d53a75
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
52 additions
and
29 deletions
+52
-29
doc/ZEO/trace.txt
doc/ZEO/trace.txt
+50
-27
src/ZEO/simul.py
src/ZEO/simul.py
+2
-2
No files found.
doc/ZEO/trace.txt
View file @
04d1d290
...
@@ -46,11 +46,11 @@ around 70% are good. 90% is excellent. If you see a hit rate under 60% you
...
@@ -46,11 +46,11 @@ around 70% are good. 90% is excellent. If you see a hit rate under 60% you
can probably improve the cache performance (and hence your Zope application
can probably improve the cache performance (and hence your Zope application
server's performance) by increasing the ZEO cache size. This is normally
server's performance) by increasing the ZEO cache size. This is normally
configured using key ``cache_size`` in the ``zeoclient`` section of your
configured using key ``cache_size`` in the ``zeoclient`` section of your
configuration file. The default cache size is 20 MB, which is
very
small.
configuration file. The default cache size is 20 MB, which is small.
The stats.py tool shows its command line syntax when invoked without
The stats.py tool shows its command line syntax when invoked without
arguments. The tracefile argument can be a gzipped file if it has a .gz
arguments. The tracefile argument can be a gzipped file if it has a .gz
extension. It will read from stdin (assuming uncompressed data) if the
extension. It will
be
read from stdin (assuming uncompressed data) if the
tracefile argument is '-'.
tracefile argument is '-'.
Simulating Different Cache Sizes
Simulating Different Cache Sizes
...
@@ -66,36 +66,59 @@ statistics is added at the end.
...
@@ -66,36 +66,59 @@ statistics is added at the end.
Example, assuming the trace file is in /tmp/cachetrace.log::
Example, assuming the trace file is in /tmp/cachetrace.log::
$ python simul.py -s 100 /tmp/cachetrace.log
$ python simul.py -s 4 /tmp/cachetrace.log
START TIME DURATION LOADS HITS INVALS WRITES FLIPS HITRATE
CircularCacheSimulation, cache size 4,194,304 bytes
Sep 4 11:59 38:01 59833 40473 257 20 2 67.6%
START TIME DURATION LOADS HITS INVALS WRITES HITRATE EVICTS INUSE
$
Jul 22 22:22 39:09 3218856 1429329 24046 41517 44.4% 40776 99.8
This shows that with a 4 MB cache size, the cache hit rate is 44.4%, the
percentage 1429329 (number of cache hits) is of 3218856 (number of load
requests). The cache simulated 40776 evictions, to make room for new object
states. At the end, 99.8% of the bytes reserved for the cache file were in
use to hold object state (the remaining 0.2% consists of "holes", bytes freed
by object eviction and not yet reused to hold another object's state).
This shows that with a 100 MB cache size, the cache hit rate is
Let's try this again with an 8 MB cache::
67.6%. So let's try this again with a 200 MB cache size::
$ python simul.py -s
200
/tmp/cachetrace.log
$ python simul.py -s
8
/tmp/cachetrace.log
START TIME DURATION LOADS HITS INVALS WRITES FLIPS HITRATE
CircularCacheSimulation, cache size 8,388,608 bytes
Sep 4 11:59 38:01 59833 40921 258 20 1 68.4%
START TIME DURATION LOADS HITS INVALS WRITES HITRATE EVICTS INUSE
$
Jul 22 22:22 39:09 3218856 2182722 31315 41517 67.8% 40016 100.0
This showed hardly any improvement. So let's try a 300 MB cache
That's a huge improvement in hit rate, which isn't surprising since these are
size::
very small cache sizes. The default cache size is 20 MB, which is still on
the small side::
$ python2.0 simul.py -s 300 /tmp/cachetrace.log
$ python simul.py /tmp/cachetrace.log
ZEOCacheSimulation, cache size 300,000,000 bytes
CircularCacheSimulation, cache size 20,971,520 bytes
START TIME DURATION LOADS HITS INVALS WRITES FLIPS HITRATE
START TIME DURATION LOADS HITS INVALS WRITES HITRATE EVICTS INUSE
Sep 4 11:59 38:01 59833 40921 258 20 0 68.4%
Jul 22 22:22 39:09 3218856 2982589 37922 41517 92.7% 37761 99.9
$
This shows that for this particular trace file, the maximum attainable
Again a very nice improvement in hit rate, and there's not a lot of room left
hit rate is 68.4%. This is probably caused by the fact that nearly a
for improvement. Let's try 100 MB::
third of the objects mentioned in the trace were loaded only once --
the cache only helps if an object is loaded more than once.
The simul.py tool also supports simulating different cache
$ python simul.py -s 100 /tmp/cachetrace.log
strategies. Since none of these are implemented, these are not
CircularCacheSimulation, cache size 104,857,600 bytes
further documented here.
START TIME DURATION LOADS HITS INVALS WRITES HITRATE EVICTS INUSE
Jul 22 22:22 39:09 3218856 3218741 39572 41517 100.0% 22778 100.0
It's very unusual to see a hit rate so high. The application here frequently
modified a very large BTree, so given enough cache space to hold the entire
BTree it rarely needed to ask the ZEO server for data: this application
reused the same objects over and over.
More typical is that a substantial number of objects will be referenced only
once. Whenever an object turns out to be loaded only once, it's a pure loss
for the cache: the first (and only) load is a cache miss; storing the object
evicts other objects, possibly causing more cache misses; and the object is
never loaded again. If, for example, a third of the objects are loaded only
once, it's quite possible for the theoretical maximum hit rate to be 67%, no
matter how large the cache.
The simul.py script also contains code to simulate different cache
strategies. Since none of these are implemented, and only the default cache
strategy's code has been updated to be aware of MVCC, these are not further
documented here.
Simulation Limitations
Simulation Limitations
----------------------
----------------------
...
@@ -118,4 +141,4 @@ exact simulation:
...
@@ -118,4 +141,4 @@ exact simulation:
requests for O will continue to be simulated cache misses, although in a
requests for O will continue to be simulated cache misses, although in a
real cache they'll likely be cache hits. On the other hand, the
real cache they'll likely be cache hits. On the other hand, the
simulated cache doesn't need to evict any objects to make room for O, so it
simulated cache doesn't need to evict any objects to make room for O, so it
may enjoy further cache hits on objects a real cache would
need to evict
.
may enjoy further cache hits on objects a real cache would
have evicted
.
src/ZEO/simul.py
View file @
04d1d290
...
@@ -299,7 +299,7 @@ class CircularCacheEntry(object):
...
@@ -299,7 +299,7 @@ class CircularCacheEntry(object):
from
ZEO.cache
import
ZEC3_HEADER_SIZE
from
ZEO.cache
import
ZEC3_HEADER_SIZE
class
CircularCacheSimulation
(
Simulation
):
class
CircularCacheSimulation
(
Simulation
):
"""Simulate the ZEO 3.0
a
cache."""
"""Simulate the ZEO 3.0 cache."""
# The cache is managed as a single file with a pointer that
# The cache is managed as a single file with a pointer that
# goes around the file, circularly, forever. New objects
# goes around the file, circularly, forever. New objects
...
@@ -572,7 +572,7 @@ class CircularCacheSimulation(Simulation):
...
@@ -572,7 +572,7 @@ class CircularCacheSimulation(Simulation):
# CAUTION: It's most likely that none of the simulators below this
# CAUTION: It's most likely that none of the simulators below this
# point work anymore. A great many changes were needed to teach
# point work anymore. A great many changes were needed to teach
# CircularCacheSimulation (above) about MVCC, including method signature
# CircularCacheSimulation (above) about MVCC, including method signature
# changes and changes in cache file format, and none of the other
s
simulator
# changes and changes in cache file format, and none of the other simulator
# classes were changed.
# classes were changed.
#############################################################################
#############################################################################
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment