Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

For each uploaded document

Answer questions

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Hash Join Performance - Computer and Information Science - Lecture Slides, Slides of Applications of Computer Sciences

Banasthali Vidyapith Applications of Computer Sciences

These lecture slides of the computer and the information sciences are very useful. The important points in these slides are:Hash Join Performance, Performance Through Prefetching, Proposed Techniques, Experimental Setup, Excessive Random, Relational Database Management System, Random Access Patterns, Cache Prefetching, Hash Join Code

Typology: Slides

2012/2013

Uploaded on 04/24/2013

bandhula 🇮🇳

4.7

(10)

91 documents

1 / 16

This page cannot be seen from the preview

Don't miss anything!

bg1

Improving Hash Join Performance

through Prefetching

1

Docsity.com

pf3

pf4

pf5

pf8

pf9

pfa

pfd

pfe

pff

Discover Slides of Applications of Computer Sciences Banasthali Vidyapith

Related documents

Join Algorithms: Iteration Join, Merge Join, Hash Join, Index Join, Cost Analysis

Context-Aware Adaptation: Overcoming Inconsistencies for Consistent Multimedia

Usability, Web Services, and Web Crawling in Computer Science

Mechanical Power - Physics - Exam Paper

Microgravity Experiments - Physics - Exam Paper

Potential Difference - Physics - Exam Paper

Moment of Inertia - Physics - Exam Paper

(1)

Parallel Plate Capacitor - Physics - Exam Paper

Net Torque - Physics - Exam Paper

Mean Displacement - Physics - Exam Paper

Neutral Atom - Physics - Exam Paper

Maximum Height - Physics - Exam Paper

Partial preview of the text

Download Hash Join Performance - Computer and Information Science - Lecture Slides and more Slides Applications of Computer Sciences in PDF only on Docsity!

Improving Hash Join Performance

through Prefetching

1

Outline

____________________________

Overview
Proposed Techniques
Experimental setup
Performance evaluation
Conclusion

Hash Join Performance

________________________________

Suffer from CPU Cache Stalls - Most of execution time is wasted on data cache misses
- 82% for partition, 73% for join
- Because of random access patterns in memory

Solution: Cache Prefetching

___________________________

Cache prefetching has been successfully

applied to several types of applications.

exploit cache prefetching to improve hash

join performance.

Overcoming These Challenges

Evaluate two new prefetching

techniques:

Group prefetching

try to hide cache miss latency across a group tuples

Software-pipelined prefetching

avoid these intermittent stalls

Group Prefetching

Hide cache miss latency across a group tuples.
Then combine the processing of a group of tuples into a

single loop body and rearrange the probe operations into stages

Process the tuples for a stage and then move to the next

stage

Add prefetch instructions to the algorithm.
issue prefetch instructions in one code stage for the

memory references in the next code stage.

Group vs. Software-Pipelined Prefetching

Hiding latency:

Software-pipelined pref is always able to hide all latencies

Book-keeping overhead:

Software-pipelined pref has more overhead

Code complexity:

Group prefetching is easier to implement
Natural group boundary provides a place to do necessary processing left (e.g. for read-write conflicts)
A natural place to send outputs to the parent operator if pipelined operator is needed

Experimental Setup

- Use a simple schema for both the build and

probe relations

- Every tuple contains a 4 byte join attribute

and a fixed length payload

- Perform join without selections and

projections.

- Assume the join phase uses 50MB memory to

join a pair of build and probe partition

Performance Evaluation cont..

User-Mode CPU Cache Performance

Join Phase Performance

This technique achieved 3.02-4.04X speedups over original hash join

Performance Evaluation cont..

Join Performance varying Memory Latency

-prefecthing techniques are effective even when the processor/memory speed gap increases dramatically

Conclusion

Even though prefetching is a promising technique for

improving CPU cache performance, applying it to the

hash join algorithm is not straightforward

(due to the dependencies within the processing of a single tuple and the randomness of Hashing)

Experimental results demonstrated that hash join

performance can be improved by using group

prefetching and software-pipelined prefetching

techniques.

Several practicle issues when used on DBMS that

targets multiple architectures