Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Advanced Databases, Study notes of Algebra

Cornwall College Algebra

The attica database system. ▻ Home-grown RDBMS, written in Java. ▻ Visit inf.ed.ac.uk/teaching/courses/adbs/attica to download the system and the API ...

Typology: Study notes

2021/2022

Uploaded on 09/12/2022

tintoretto 🇬🇧

4.1

(8)

214 documents

1 / 721

This page cannot be seen from the preview

Don't miss anything!

Advanced Databases

Stratis D. Viglas

University of Edinburgh

Stratis D. Viglas (University of Edinburgh) Advanced Databases 1/1

Discover Study notes of Algebra Cornwall College

Partial preview of the text

Download Advanced Databases and more Study notes Algebra in PDF only on Docsity!

Advanced Databases

Stratis D. Viglas

University of Edinburgh

Introduction Overview

Outline

Introduction Overview

Syllabus

Introduction

Relational databases overview

I (^) Data model, evaluation model

Storage

I (^) Indexes, multidimensional data

Query evaluation

I (^) Join evaluation algorithms, execution models

Query optimisation

I (^) Cost models, search space exploration, randomised optimisation

Concurrency control and recovery

I (^) Locking and transaction processing

Parallel databases

Introduction Overview

Assignments and software

Programming assignments

The attica database system

I (^) Home-grown RDBMS, written in Java I (^) Visit inf.ed.ac.uk/teaching/courses/adbs/attica to download the system and the API documentation I (^) All programming assignments will be using the attica front-end and code-base

Plagiarism policy: You cheat, you’re caught, you fail

I (^) No discussion

Introduction Relational databases overview

Outline

Introduction Relational databases overview

Three basic building blocks

Attribute

I (^) A (name, value) pair

Tuple

I (^) A set of attributes

Relation

I (^) A set of tuples with the same schema

SID 123-ABC

Name Mary Jones

... ...

Year 4

SID 123-ABC

Name Mary Jones

... ...

Year 4 456-DEF John Smith ... 3 ... ... ... ... 999-XYZ Jack Black ... 4

Introduction Relational databases overview

Data storage

Page

Platter

Track

Cylinder }Drive

Disk drives are organised in records of 512 bytes The DB (and the OS) I/O unit is a disk page (typically, 4,096 bytes long) Pages (and records) are stored on tracks Tracks make up a platter (or a disk) Platters make up a drive The same tracks across all platters make up a cylinder The disk head (arm) reads the same block of all tracks on all platters

Introduction Relational databases overview

A bit of perspective

The dimensions of the head are impressive^1. With a width of less than

a hundred nanometers and a thickness of about ten, it flies above the

platter at a speed of up to 15,000 RPM, at a height that is the

equivalent of 40 atoms. If you start multiplying these infinitesimally

small numbers, you begin to get an idea of their significance.

Consider this little comparison: if the read/write head were a Boeing

747 , and the hard-disk platter were the surface of the Earth

I (^) The head would fly at Mach 800 I (^) At less than one centimeter from the ground I (^) And count every blade of grass I (^) Making fewer than 10 unrecoverable counting errors in an area equivalent to all of Ireland

(^1) Source: Matthieu Lamelot, Tom’s Hardware.

Introduction Relational databases overview

Storing tuples

Every disk block contains

I (^) A header I (^) Data (i.e., tuples) I (^) Padding (maybe)

Two ways of storing tuples

I (^) Either interleave tuples of multiple relations, or I (^) Keep the tuples of the same relation clustered

Header Relation 1 Relation 2 Relation 3 Relation 2 Relation 3 Relation 1 Relation 2 Relation 3 Padding

Interleaved tuples

Header Relation 1

Relation 1 Padding

Relation 1 Relation 1 Relation 1 Relation 1 Relation 1 Relation 1 Relation 1

Clustered tuples

Introduction Relational databases overview

Advantages of clustering

Scan a relation of X tuples, Y

tuples per block

I (^) If unclustered, worst case scenario: read X blocks I (^) Clustered: read X /Y blocks

How about clustering disk

blocks?

I (^) Reduces unnecessary arm movement

Unclustered storage

Clustered storage

Introduction Relational databases overview

What does the buffer manager do?

When a page is requested it:

I (^) Checks to see if the page is in the buffer pool; if so it returns it I (^) If not, it checks whether there is room in the buffer pool; if so it reads it in and places it in the available room I (^) If not, it picks a page for replacement; if the page has been “touched” it writes the page to disk and replaces it I (^) In all three cases, it updates the reference count for the requested page I (^) If necessary, it pins the new page I (^) It returns a handle to the new page

Introduction Relational databases overview

Page replacement

Least recently used (LRU): check the number of references for each

page; replace a page from the group with the lowest count (usually

implemented with a priority queue)

I (^) Variant: clock replacement

First In First Out (FIFO)

Most recently used (MRU): the inverse of LRU

Random!

Storage and indexing Overview

Outline

Storage and indexing Overview

Indexing and sorting

Can be summarised as:

I (^) Forget whatever you’ve learned about indexing, searching and sorting in main memory (well, almost.. .)

Remember, we are operating over disk files

I (^) The main idea is to minimise disk I/O and not number of comparisons (i.e., complexity) I (^) Just an idea: comparing two values in memory costs 4. 91 · 10 −^8 seconds; Comparing two values on disk costs 18. 2 · 10 −^5 seconds ( orders of magnitude more expensive.)

Advanced Databases, Study notes of Algebra

Related documents

Partial preview of the text

Download Advanced Databases and more Study notes Algebra in PDF only on Docsity!

Advanced Databases

Stratis D. Viglas

Outline

Syllabus

Introduction

Relational databases overview

Storage

Query evaluation

Query optimisation

Concurrency control and recovery

Parallel databases

Assignments and software

Programming assignments

The attica database system

Plagiarism policy: You cheat, you’re caught, you fail

Outline

Three basic building blocks

Attribute

Tuple

Relation

Data storage

Cylinder }Drive

A bit of perspective

The dimensions of the head are impressive^1. With a width of less than

a hundred nanometers and a thickness of about ten, it flies above the

platter at a speed of up to 15,000 RPM, at a height that is the

equivalent of 40 atoms. If you start multiplying these infinitesimally

small numbers, you begin to get an idea of their significance.

Consider this little comparison: if the read/write head were a Boeing

747 , and the hard-disk platter were the surface of the Earth

Storing tuples

Every disk block contains

Two ways of storing tuples

Advantages of clustering

Scan a relation of X tuples, Y

tuples per block

How about clustering disk

blocks?

What does the buffer manager do?

When a page is requested it:

Page replacement

Least recently used (LRU): check the number of references for each

page; replace a page from the group with the lowest count (usually

implemented with a priority queue)

First In First Out (FIFO)

Most recently used (MRU): the inverse of LRU

Random!

Outline

Indexing and sorting

Can be summarised as:

Remember, we are operating over disk files