



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Deep Learning Project and datasets
Typology: Cheat Sheet
1 / 7
This page cannot be seen from the preview
Don't miss anything!




โ Bing Coronavirus โ Classify Bing Queries as either specific (e.g. about a specific location) or generic. You might have to figure out a more exact definition of specific or generic though โ Dataset: BingCoronavirusQuerySet โ Covid Clinical Data โ Rank and sort high risk patients using clinical data. Pick an interpretable approach if you can. โ Dataset: CovidClinicalData If you haven't already, checkout Kaggle's Covid19 Section as well. It has datasets and ideas both.
โ Autonomous Tagging of StackOverflow Questions โ Make a multi-label classification system that automatically assigns tags for questions posted on a forum such as StackOverflow or Quora. โ Dataset: StackLite or 10% sample โ Keyword/Concept identification โ Identify keywords from millions of questions โ Dataset: StackOverflow question samples by Facebook โ Topic identification โ Multi-label classification of printed media articles to topics โ Dataset: Greek Media monitoring multi-label classification
โ Dataset: 45 years of rainfall data โ Multi-variate Time Series Forecasting โ How polluted will your town's air be? Pollution Level Forecasting โ Dataset: Air Quality dataset โ Demand/load forecasting โ Find a short term forecast on electricity consumption of a single home โ Dataset: Electricity consumption of a household โ Predict Blood Donation โ We're interested in predicting if a blood donor will donate within a given time window. โ More on the problem statement at Driven Data. โ Dataset: UCI ML Datasets Repo
โ Movie Recommender โ Can you predict the rating a user will give on a movie? โ Do this using the movies that user has rated in the past, as well as the ratings similar users have given similar movies. โ Dataset: Netflix Prize and MovieLens Datasets โ Search + Recommendation System โ Predict which Xbox game a visitor will be most interested in based on their search query โ Dataset: BestBuy โ Can you predict Influencers in the Social Network? โ How can you predict social influencers? โ Dataset: PeerIndex
โ Image classification โ Object recognition or image classification task is how Deep Learning shot up to it's present-day resurgence โ Datasets: โ CIFAR- โ ImageNet โ MS COCO is the modern replacement to the ImageNet challenge
โ MNIST Handwritten Digit Classification Challenge is the classic entry point โ Character recognition (digits) is the good old Optical Character Recognition problem โ Bird Species Identification from an Image using the Caltech-UCSD Birds dataset dataset โ Diagnosing and Segmenting Brain Tumors and Phenotypes using MRI Scans โ Dataset: MICCAI Machine Learning Challenge aka MLC 2014 โ Identify endangered right whales in aerial photographs โ Dataset: MOAA Right Whale โ Can computer vision spot distracted drivers? โ Dataset: State Farm Distracted Driver Detection on Kaggle โ Bone X-Ray competition โ Can you identify if a hand is broken from a X-ray radiographs automatically with better than human performance? โ Stanford's Bone XRay Deep Learning Competition with MURA Dataset โ Image Captioning โ Can you caption/explain the photo a way human would? โ Dataset: MS COCO โ Image Segmentation/Object Detection โ Can you extract an object of interest from an image? โ Dataset: MS COCO, Carvana Image Masking Challenge on Kaggle โ Large-Scale Video Understanding โ Can you produce the best video tag predictions? โ Dataset: YouTube 8M โ Video Summarization โ Can you select the semantically relevant/important parts from the video? โ Example: Fast-Forward Video Based on Semantic Extraction โ Dataset: Unaware of any standard dataset or agreed upon metrics? I think YouTube 8M might be good starting point. โ Style Transfer โ Can you recompose images in the style of other images? โ Dataset: fzliu on GitHub shared target and source images with results
Data Science ML Full Stack Roadmap https://github.com/hemansnation/Data-Science-ML-Full-Stack- Join the Data Science & ML Full Stack WhatsApp Group Community here: If the group is full, please join another one. https://chat.whatsapp.com/B7Mdp6QTMJ0KZYGWrziT3Y https://chat.whatsapp.com/HWDSJU4KXrXJIcn5Npp3Gm https://chat.whatsapp.com/DmATV5uaVY7IKrTMHDiHnr https://chat.whatsapp.com/Blz2n8QYSgdKWfQbJZxHtJ Join Telegram for Data Science ML AI Resources: https://t.me/+sREuRiFssMo4YWJl Join Community on LinkedIn: https://www.linkedin.com/groups/12540639/ Connect with me on these platforms: LinkedIn: https://www.linkedin.com/in/hemansnation/ Twitter: https://twitter.com/hemansnation GitHub: https://github.com/hemansnation Instagram: https://www.instagram.com/masterdexter.ai/ Are you a professional? DM for One-on-One sessions for Python, Data Science, Machine Learning, and Data Engineering. Here: https://bit.ly/3U6zQvQ