Lecture Notes on Dynamic Binary Search Trees | CS 373 | Study notes Computer Science

CS 373 Lecture 8: Dynamic Binary Search Trees Fall 2002

Everything was balanced before the computers went off line. Try and

adjust something, and you unbalance something else. Try and adjust

that, you unbalance two more and before you know what’s happened, the

ship is out of control.

— Blake, Blake’s 7, “Breakdown” (March 6, 1978)

A good scap egoat is nearly as welcome as a solution to the problem.

— Anonymous

Let’s play.

— El Mariachi [Antonio Banderas], Desperado (1992)

8 Dynamic Binary Search Trees (February 8)

8.1 Definitions

I’ll assume that everyone is already familiar with the standard terminology for binary search trees—

node, search key, edge, root, internal node, leaf, right child, left child, parent, descendant, sibling,

ancestor, subtree, preorder, postorder, inorder, etc.—as well as the standard algorithms for search-

ing for a node, inserting a node, or deleting a node. Otherwise, see Chapter 12 of CLRS.

For this lecture, we will consider only full binary trees—where every internal node has exactly

two children—where only the internal nodes actually store search keys. In practice, we can represent

the leaves with null pointers.

Recall that the depth d(v) of a node vis its distance from the root, and its height h(v) is the

distance to the farthest leaf in its subtree. The height (or depth) of the tree is just the height of

the root. The size |v|of vis the number of nodes in its subtree. The size of the whole tree is just

the total number of nodes, which I’ll usually denote by n.

A tree with height hhas at most 2hleaves, so the minimum height of an n-leaf binary tree

is dlg ne. In the worst case, the time required for a search, insertion, or deletion to the height

of the tree, so in general we would like keep the height as close to lg nas possible. The best we

can possibly do is to have a perfectly balanced tree, in which each subtree has as close to half the

leaves as possible, and both subtrees are perfectly balanced. The height of a perfectly balanced tree

is dlg ne, so the worst-case search time is O(log n). However, even if we started with a perfectly

balanced tree, a malicious sequence of insertions and/or deletions could make the tree arbitrarily

unbalanced, driving the search time up to Θ(n).

To avoid this problem, we need to periodically modify the tree to maintain ‘balance’. There

are several methods for doing this, and depending on the method we use, the search tree is given

a different name. Examples include AVL trees, red-black trees, height-balanced trees, weight-

balanced trees, bounded-balance trees, path-balanced trees, B-trees, treaps, randomized binary

search trees, skip lists,1and jumplists.2Some of these trees support searches, insertions, and

deletions, in O(log n)worst-case time, others in O(log n)amortized time, still others in O(log n)

expected time.

In this lecture, I’ll discuss two binary search tree data structures with good amortized perfor-

mance. The first is the scapegoat tree, discovered by Arne Andersson in 1989 and independently by

1Yeah, yeah. Skip lists aren’t really binary search trees. Whatever you say, Mr. Picky.

2These are essentially randomized variants of the Phobian binary search trees you saw in the first midterm!

[H. Br¨onnimann, F. Cazals, and M. Durand. Randomized jumplists: A jump-and-walk dictionary data structure.

Manuscript, 2002. http://photon.poly.edu/∼hbr/publi/jumplist.html.] So now you know who to blame.

Lecture Notes on Dynamic Binary Search Trees | CS 373, Study notes of Computer Science

Related documents

Partial preview of the text

Download Lecture Notes on Dynamic Binary Search Trees | CS 373 and more Study notes Computer Science in PDF only on Docsity!

8 Dynamic Binary Search Trees (February 8)

8.1 Definitions

8.2 Lazy Deletions: Global Rebuilding

8.3 Insertions: Partial Rebuilding

8.4 Scapegoat Trees

8.5 Rotations, Double Rotations, and Splaying

8.6 Splay Trees