Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Z
ZODB
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Kirill Smelkov
ZODB
Commits
2fb72349
Commit
2fb72349
authored
Jun 01, 2002
by
Tim Peters
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Clarified (I hope <wink>) BTREE_SEARCH's correctness proof.
parent
1d88ddbc
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
31 additions
and
25 deletions
+31
-25
src/BTrees/Maintainer.txt
src/BTrees/Maintainer.txt
+31
-25
No files found.
src/BTrees/Maintainer.txt
View file @
2fb72349
...
@@ -192,48 +192,54 @@ BTree nodes.)
...
@@ -192,48 +192,54 @@ BTree nodes.)
When searching for a key k, then, the child pointer we want to follow
When searching for a key k, then, the child pointer we want to follow
is the one at index i such that K(i) <= k < K(i+1). There can be
is the one at index i such that K(i) <= k < K(i+1). There can be
only one such i, since the keys are strictly increasing. And there is
at most one such i, since the K(i) are strictly increasing. And there
at *least* one such i provided the tree isn't empty. For the moment,
is at least one such i provided the tree isn't empty (so that 0 < len).
assume the tree isn't empty (we'll get back to that later).
For the moment, assume the tree isn't empty (we'll get back to that
later).
The macro's chief loop invariant is
The macro's chief loop invariant is
K(lo) < k < K(hi)
K(lo) < k < K(hi)
This holds trivially at the start, since lo is set to 0
ah
d hi to
This holds trivially at the start, since lo is set to 0
, an
d hi to
x->len, and we pretend K(0) is minus infinity and K(len) is plus
x->len, and we pretend K(0) is minus infinity and K(len) is plus
infinity. Inside the loop, if K(i) < k we set lo to i, and if
infinity. Inside the loop, if K(i) < k we set lo to i, and if
K(i) > k we set hi to i. These obviously preserve the invariant.
K(i) > k we set hi to i. These obviously preserve the invariant.
If K(i) == k, the loop breaks and sets the result to i, and since
If K(i) == k, the loop breaks and sets the result to i, and since
K(i) == k in that case i is obviously the correct result.
K(i) == k in that case i is obviously the correct result.
What if the key isn't present? lo and hi move toward each other,
Other cases depend on how i = floor((lo + hi)/2) works, exactly.
narrowing the range, until eventually lo+1 == hi. At that point,
Suppose lo + d = hi for some d >= 0. Then i = floor((lo + lo + d)/2) =
i = (lo+hi)/2 = (lo+lo+1)/2 = lo + 1/2 = lo, so that
:
floor(lo + d/2) = lo + floor(d/2). So
:
1. The loop's "i > lo" test is false, so the loop ends then.
a. [d == 0] (lo == i == hi) if and only if (lo == hi).
b. [d == 1] (lo == i < hi) if and only if (lo+1 == hi).
c. [d > 1] (lo < i < hi) if and only if (lo+1 < hi).
and
If the node is empty (x->len == 0), then lo==i==hi==0 at the start,
and the loop exits immediately (the first "i > lo" test fails),
without entering the body.
2. The invariant still holds, so K(i) < k < K(i+1), and i is again
Else lo < hi at the start, and the invariant K(lo) < k < K(hi) holds.
the correct answer.
Can we get out of the loop too early? No: if hi = lo + d for some d
If lo+1 < hi, we're in case #c: i is strictly between lo and hi,
greater than 1, then i = (lo+lo+d)/2 = lo + d/2, and d/2 is at least 1
so the loop body is entered, and regardless of whether the body sets
since d is at least 2: i is strictly greater than lo then, and the
the new lo or the new hi to i, the new lo is strictly less than the
loop continues.
new hi, and the difference between the new lo and new hi is strictly
less than the difference between the old lo and old hi. So long as
the new lo + 1 remains < the new hi, we stay in this case. We can't
stay in this case forever, though: because hi-lo decreases on each
trip but remains > 0, lo+1 == hi must eventually become true. (In
fact, it becomes true quickly, in about log2(x->len) trips; the
point is more that lo doesn't equal hi when the loop ends, it has to
end with lo+1==hi and i==lo).
Can lo==hi? Yes, but only if the node is empty. Then i, lo and hi
Then we're in case #b: i==lo==hi-1 then, and the loop exits. The
all start out as 0, and the loop exits immediately. If the loop
invariant still holds, with lo==i and hi==lo+1==i+1:
isn't empty, then lo and hi start out with different values. Whenever
lo and hi have different values, lo <= (lo + hi)/2 < hi, so i and lo
are strictly smaller than hi, so setting either lo or hi to i leaves
the new lo strictly smaller than the new hi.
Can the loop fail to terminate? No: by the above, when lo < hi-1,
K(i) < k < K(i+1)
lo < i=(lo+hi)/2 < hi, so setting either lo or hi to i leaves the
new lo and hi strictly closer to each other than were the old lo and
so i is again the correct answer.
hi.
Optimization points:
Optimization points:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment