








Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Various strategies for joining and indexing large relational databases using different algorithms and index types. Topics include selection, projection, join algorithms, and set operations. The document also provides cost analyses for different scenarios and discusses the trade-offs between various indexing strategies.
Typology: Study notes
1 / 14
This page cannot be seen from the preview
Don't miss anything!









foreach tuple r in
do foreach tuple s in
do if r.sid
s.sid then add <r, s> to result Cost
Should
be the Smaller/Larger relation?
Consider the join
(R.a=S.b)
Ignore the cost of writing out the result. Relation
contains
tuples and has
tuples per page. Relation
contains
tuples and also has
tuples per page. Attribute b of relation
is the primary key for
Both relations are stored as simple heap files. Neither relation has any indexes built on it. 52 buffer pages are available.
What is the cost of joining
and
using a page
oriented simple nested loops join? What is the minimum number of buffer pages required for this cost to remain unchanged?
What is the cost of joining
and
using a block nested loops join? What is the minimum number of buffer pages required for this cost to remain unchanged?
What is the cost of joining
and
using a sort ‐merge join?
What is the cost of joining
and
using a hash join?
What would be the lowest possible
cost for joining
and
using any join algorithm, and how much buffer space would be needed to achieve this cost?
How many tuples does the join of
and
produce, at most?
th