Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Red Wine Quality on Machine Learning, Study Guides, Projects, Research of Machine Learning

Panjab University Machine Learning

It is machine learning project which checks the quality of red wine.

Typology: Study Guides, Projects, Research

2018/2019

Uploaded on 12/16/2019

deepshikha-dogra 🇮🇳

4.3

(3)

1 document

1 / 23

This page cannot be seen from the preview

Don't miss anything!

A REPORT ON

RED WINE QUALITY

ThinkNEXT Technologies Private Lmited

SUBMITTED IN PARTIAL FULFILLMENT FOR AWARD DEGREE OF

BACHELOR OF ENGINEERING

COMPUTER SCIENCE AND ENGINEERING

DEEPSHIKHA

(SG16312)

7th SEM.

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

UIET, PANJAB UNIVERSITY SSG REGIONAL CENTRE,

HOSHIARPUR-146021, Punjab (INDIA)

(2019)

Discover Study Guides, Projects, Research of Machine Learning Panjab University

Partial preview of the text

Download Red Wine Quality on Machine Learning and more Study Guides, Projects, Research Machine Learning in PDF only on Docsity!

A REPORT ON

RED WINE QUALITY

AT

ThinkNEXT Technologies Private Lmited

SUBMITTED IN PARTIAL FULFILLMENT FOR AWARD DEGREE OF

BACHELOR OF ENGINEERING

IN

COMPUTER SCIENCE AND ENGINEERING

BY

DEEPSHIKHA

(SG16312)

7th SEM. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

UIET, PANJAB UNIVERSITY SSG REGIONAL CENTRE,

HOSHIARPUR-146021, Punjab (INDIA)

DECLARATION

The work embodied in the training report entitled, “Red Wine Quality” submitted to the department of Computer Science and Engineering at UIET, Panjab University Swami Sarvanand Giri Regional Centre, Hoshiarpur for the award of degree of Bachelor of Engineering, has been done by me. The training report is entirely based on my own work and not submitted elsewhere for the award of any other degree. All ideas and references have been duly acknowledged. Deepshikha Countersigned by: (Supervisor)

ABSTRACT

This project is based on the application of Logistic Regression. This project checks the quality of red wine. It trains the data model and then predicts the output. It also provides the accuracy of the predicted data. The dataset for model is downloaded from www.kaggle.com. Our project includes

Training the data
Predicts the output based on trained data
Provides Accuracy score

_____________________________________________________________________

 - 1 Declaration S.No. Title Page No. - 2 Acknowledgement - 3 Abstract

Chapter 1: TOOLS USED 6- - 1.1 Operating System - 1.2 Anaconda Navigator - 1.3 Jupyter Notebook
Chapter 2: TECHNOLOGY USED 11- - 2.1 : Python - 2.2 : Machine Learning - 2.3 : Logistic Regression
CHAPTER 3: INTRODUCTION TO PROJECT 16- - 3.1 : RED WINE QUALITY - 3.2 : DEPENDENCIES - 3.3 : APPROACH - 4. CONCLUSION - 5. FUTURE SCOPE
- 1. BIBLIOGRAPHY

2. ANACONDA NAVIGATOR

Anaconda Navigator is a desktop graphical user interface (GUI) included in Anaconda® distribution that allows you to launch applications and easily manage conda packages, environments, and channels without using command-line commands. Navigator can search for packages on Anaconda Cloud or in a local Anaconda Repository. It is available for Windows, macOS, and Linux. To get Navigator, get the Navigator Cheat Sheet and install Anaconda. The Getting started with Navigator section shows how to start Navigator from the shortcuts or from a terminal window.

Why use Navigator?

In order to run, many scientific packages depend on specific versions of other packages. Data scientists often use multiple versions of many packages and use multiple environments to separate these different versions.

The command-line program conda is both a package manager and an environment manager. This helps data scientists ensure that each version of each package has all the dependencies it requires and works correctly. Navigator is an easy, point-and-click way to work with packages and environments without needing to type conda commands in a terminal window. You can use it to find the packages you want, install them in an environment, run the packages, and update them – all inside Navigator.

What applications can be accessed?

The following applications are available by default in Navigator:  JupyterLab  Jupyter Notebook  Spyder  VSCode  Glueviz  Orange 3 App

TECHNOLOGY USED

1. PYTHON

Python is an interpreted, high-level, general-purpose programming language. Created by Guido van Rossum and first released in 1991, Python's design philosophy emphasizes code readability with its notable use of significant whitespace. Its language constructs and object-oriented approach aim to help programmers write clear, logical code for small and large-scale projects. Python is dynamically typed and garbage-collected. It supports multiple programming paradigms, including procedural, object-oriented, and functional programming. Python is often described as a "batteries included" language due to its comprehensive standard library. Most Python implementations (including CPython) include a read–eval–print loop (REPL), permitting them to function as a command line interpreter for which the user enters statements sequentially and receives results immediately. Other shells, including IDLE and IPython, add further abilities such as auto-completion, session state retention and syntax highlighting. As well as standard desktop integrated development environments, there are Web browser-based IDEs; SageMath (intended for developing science and math-related Python programs); PythonAnywhere, a browser-based IDE and hosting environment; and Canopy IDE, a commercial Python IDE emphasizing scientific computing.

What is Unsupervised Learning? The model learns through observation and finds structures in the data. Once the model is given a dataset, it automatically finds patterns and relationships in the dataset by creating clusters in it. What it cannot do is add labels to the cluster, like it cannot say this a group of apples or mangoes, but it will separate all the apples from mangoes. Suppose we presented images of apples, bananas and mangoes to the model, so what it does, based on some patterns and relationships it creates clusters and divides the dataset into those clusters. Now if a new data is fed to the model, it adds it to one of the created clusters. What is Reinforcement Learning? It is the ability of an agent to interact with the environment and find out what is the best outcome. It follows the concept of hit and trial method. The agent is rewarded or penalized with a point for a correct or a wrong answer, and on the basis of the positive reward points gained the model trains itself. And again once trained it gets ready to predict the new data presented to it.

How ML works?

3. LOGISTIC REGRESSION

What is Regression? Regression analysis is a powerful statistical analysis technique. A dependent variable of our interest is used to predict the values of other independent variables in a data-set. We come across regression in an intuitive way all the time. Like predicting the weather using the data-set of the weather conditions in the past. It uses many techniques to analyse and predict the outcome, but the emphasis is mainly on relationship between dependent variable and one or more independent variable. Logistic regression analysis predicts the outcome in a binary variable which has only two possible outcomes. Logistic Regression In Python It is a technique to analyse a data-set which has a dependent variable and one or more independent variables to predict the outcome in a binary variable, meaning it will have only two outcomes. The dependent variable is categorical in nature. Dependent variable is also referred as target variable and the independent variables are called the predictors.

The task here is to predict the quality of red wine on a scale of 0–10 given a set of features as inputs. I have solved it as a regression problem using Logistic Regression. Input variables are fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, density, pH, sulphates, alcohol. And the output variable (based on sensory data) is quality (score between 0 and 10). Dependencies The code is in python. Other than this, I installed the following libraries using pip.

Pandas: pip install pandas
matplotlib: pip install matplotlib
numpy: pip install numpy

Approach

Import all the required libraries.
Load the dataset.
Check the missing information.

Perform analysis

Calculate accuracy score
Classification report and confusion matrix
Plotting

Heatmap
Correlation between pH and fixed acidity

Red Wine Quality on Machine Learning, Study Guides, Projects, Research of Machine Learning

Related documents

Partial preview of the text

Download Red Wine Quality on Machine Learning and more Study Guides, Projects, Research Machine Learning in PDF only on Docsity!

A REPORT ON

RED WINE QUALITY

AT

ThinkNEXT Technologies Private Lmited

SUBMITTED IN PARTIAL FULFILLMENT FOR AWARD DEGREE OF

BACHELOR OF ENGINEERING

IN

COMPUTER SCIENCE AND ENGINEERING

BY

DEEPSHIKHA

(SG16312)

UIET, PANJAB UNIVERSITY SSG REGIONAL CENTRE,

HOSHIARPUR-146021, Punjab (INDIA)

DECLARATION

ABSTRACT

CONTENTS

_____________________________________________________________________

2. ANACONDA NAVIGATOR

Why use Navigator?

What applications can be accessed?

TECHNOLOGY USED

1. PYTHON

How ML works?

3. LOGISTIC REGRESSION

Approach