Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Well done documents for boards, Summaries of Computer science

Computer science

Well done documents for boards

Typology: Summaries

2022/2023

Uploaded on 01/27/2023

sweta_biswasroy 🇮🇳

3 documents

1 / 39

This page cannot be seen from the preview

Don't miss anything!

Chapter 1

Data Handling

using Pandas

New

syllabus

2022-23

Visit : python.mykvs.in for regular updates

Partial preview of the text

Download Well done documents for boards and more Summaries Computer science in PDF only on Docsity!

Chapter 1 Data Handling using Pandas New syllabus 2022 - 23

Visit : python.mykvs.in for regular updates Visit : python.mykvs.in for regular updates Visit : python.mykvs.in for regular updates Python Library – Pandas It is a most famous Python package for data science, which offers powerful and flexible data structures that make data analysis and manipulation easy.Pandas makes data importing and data analyzing much easier. Pandas builds on packages like NumPy and matplotlib to give us a single & convenient place for data analysis and visualization work.

Basic Features of Pandas

Dataframe object help a lot in keeping track of our data.
With a pandas dataframe, we can have different data types (float, int, string, datetime, etc) all in one place
Pandas has built in functionality for like easy grouping & easy joins of data, rolling windows
Good IO capabilities; Easily pull data from a MySQL database directly into a data frame
With pandas, you can use patsy for R-style syntax in doing regressions.
Tools for loading data into in-memory data objects from different file formats.
Data alignment and integrated handling of missing data.
Reshaping and pivoting of data sets.
Label-based slicing, indexing and subsetting of large data sets.

Visit : python.myks.in for regular updates Pandas – Installation/Environment Setup Pandas module doesn't come bundled with Standard Python. If we install Anaconda Python package Pandas will be installed by default. Steps for Anaconda installation & Use

1. visit the site https://www.anaconda.com/download/

Download appropriate anaconda installer
After download install it.
During installation check for set path and all user
After installation start spyder utility of anaconda from start menu

6. Type import pandas as pd in left pane(temp.py)

Then run it.
If no error is show then it shows pandas is installed.
Like default temp.py we can create another .py file from new window option of file menu for new program.

Pandas – Installation/Environment Setup 4.Now move to script folder of python distribution in command prompt (through cmd command of windows).

Execute following commands in command prompt serially.

pip install numpy pip install six pip install pandas Wait after each command for installation Now we will be able to use pandas in standard python distribution.

6. Type import pandas as pd in python (IDLE) shell.

If it executed without error(it means pandas is installed on your system)

Data Structures in Pandas Two important data structures of pandas are–Series, DataFrame

Series Series is like a one-dimensional array like structure with homogeneous data. For example, the following series is a collection of integers. Basic feature of series are ❖ Homogeneous data ❖ Size Immutable ❖ Values of Data Mutable

Pandas Series It is like one-dimensional array capable of holding data of any type (integer, string, float, python objects, etc.). Series can be created using constructor. Syntax :- pandas.Series( data, index, dtype, copy) Creation of Series is also possible from – ndarray, dictionary, scalar value. Series can be created using

Array
Dict
Scalar value or constant

Pandas Series Create an Empty Series e.g. import pandas as pseries s = pseries.Series() print(s) Output Series([], dtype: float64)

Pandas Series Create a Series from dict Eg.1(without index) import pandas as pd import numpy as np data = {'a' : 0., 'b' : 1., 'c' : 2.} s = pd1.Series(data) print(s) Output a 0. b 1. c 2. dtype: float Eg.2 (with index) import pandas as pd import numpy as np data = {'a' : 0., 'b' : 1., 'c' : 2.} s = pd1.Series(data,index=['b','c','d','a']) print(s) Output b 1. c 2. d NaN a 0. dtype: float

Create a Series from Scalar e.g import pandas as pd import numpy as np s = pd1.Series(5, index=[0, 1, 2, 3]) print(s) Output 0 5 1 5 2 5 3 5 dtype: int Note :- here 5 is repeated for 4 times (as per no of index)

Pandas Series Head function e.g import pandas as pd s = pd1.Series([1,2,3,4,5],index = ['a','b','c','d','e']) print (s.head(3)) Output a 1 b. 2 c. 3 dtype: int Return first 3 elements

Pandas Series tail function e.g import pandas as pd s = pd1.Series([1,2,3,4,5],index = ['a','b','c','d','e']) print (s.tail(3)) Output c 3 d. 4 e. 5 dtype: int Return last 3 elements

Pandas Series Retrieve Data Using Label as (Index) e.g. import pandas as pd s = pd1.Series([1,2,3,4,5],index = ['a','b','c','d','e']) print (s[['c','d']]) Output c 3 d 4 dtype: int

Well done documents for boards, Summaries of Computer science

Related documents

Partial preview of the text

Download Well done documents for boards and more Summaries Computer science in PDF only on Docsity!

1. visit the site https://www.anaconda.com/download/

6. Type import pandas as pd in left pane(temp.py)

6. Type import pandas as pd in python (IDLE) shell.

There are three methods for data selection:

▪ loc gets rows (or columns) with particular labels from

the index.

▪ iloc gets rows (or columns) at particular positions in

the index (so it only takes integers).

▪ ix usually tries to behave like loc but falls back to

behaving like iloc if a label is not present in the index.

ix is deprecated and the use of loc and iloc is encouraged

instead