s.no | NAME | DATA DESCRIPTION |
1 | Portugese Bank Marketing Project | The data is related with direct marketing campaigns (phone calls) of a Portuguese banking institution. The classification goal is to predict if the client will subscribe to a term deposit (variable y).
|
2 | Rice Leaf Disease using CNN | The dataset contains images of rice leaf. There are three classes/diseases: Bacterial leaf blight, Brown spot, and Leaf smut, each having 40 images. The format of all images are jpg.
|
3 | Handwritten Digits Image Processing Dataset | The dataset contains 60,000 small square 28×28 pixel grayscale images of handwritten single digits between 0 and 9.The task is to classify a given image of a handwritten digit into one of 10 classes representing integer values from 0 to 9, inclusively. |
4 | Customer Transaction Prediction | Identify which customers will make a specific transaction in the future, irrespective of the amount of money transacted.
|
5 | Fifa 20 | Using clustering algorithms to categorize similar players using their traits and their different skills in the field.
|
6 | Forest Type Cover Prediction | Predict the forest cover type (the predominant kind of tree cover) from strictly cartographic variables (as opposed to remotely sensed data).
|
7 | Home Loan Default – Risk Management | Contains multiple databases and sources to predict how capable each loan applicant is competent in repaying the loan. |
8 | Liver Patient Prediction | This is a Liver Patient identification Data Set. The target is to identify whether the patient has an anomaly or abnormal functioning of the liver.
|
9 | NBA Shot Selection | This is a Basketball shot selection Data Set. Using 20+ years of data on the dunks and misses, the target is to predict which shots will go through the net and which doesn’t.
|
10 | Cellphone Price Range | This is a Cellphone Price Range Data Set. Sales data of various mobile phone companies have been collected. The bigger scope is to find the relationship between features of a cell phone to its selling price. The target here is to predict a price range of the phone and not the actual price. |
11 | Insurance Claim Prediction | This is an Insurance Claim Prediction Data Set. The bigger scope is to account for inaccuracies in the cost of insurance policies. A cautious driver should get a better price compared to a reckless driver. The target here is to build a Machine Learning model that predicts the probability that a driver will initiate an insurance claim in the following year.
|
12 | Blood Donation Prediction | This is a Blood Donation Prediction Data Set. The dataset contains blood donation details of regular university visits. The target is to predict whether a donor will give blood the next time the blood donation campaign is held again at the university campus. |
13 | Game AI – Winner Prediction | This is a PUBG game Data Set. The dataset contains a large number of anonymized game stats for a single player with all match types. The target is to create a ML model which predicts players’ finishing placement based on their final stats.
|
14 | Walking Running Classification | This dataset contains accelerometer and gyroscope data associated when a person is running and walking. The target is to create a LSTM model to predict the state of body from input of sensor. |
15 | Vaccination Prediction | The dataset contains the respondents information about the pandemic survey where they answered if they had received the H1N1 and seasonal flu vaccines. The goal is to predict how likely individuals are to receive their H1N1 and seasonal flu vaccines. In specific, the target is to predict two probabilities: one for h1n1_vaccine and one for seasonal_vaccine.
|
16 | Earthquake Damage Prediction | The dataset contains the survey results of one of the largest post-disaster data ever collected by the Central Bureau of Statistics. The data contains valuable information on the impact of earthquakes, household conditions, and socio-economic-demographic details. The target is to predict the level of damage to buildings caused by the earthquake.
|
17 | Heart Disease Prediction | The dataset is an excerpt from a study of heart disease which contains various measurements on patient health and cardiovascular statistics. The target is to predict if a patient has any cardiovascular disease or not.
|
18 | Automobile Imports Price Prediction | This is an Automobile Imports Data Set which contains various characteristics of the cars including insurance risk rating and other price indicators. The target is to predict the price of the cars. |
19 | Bike Rental Dataset | This is a Bike Rental dataset which contains the hourly and daily count of rental bikes in a Capital bikeshare system with the corresponding weather and seasonal information as well. The target is to predict the count of total rental bikes including both casual and registered. |
20 | Concrete compressive strength | This is a dataset containing the different factors that can affect the compressive strength of concrete. The target is to use those factors to predict the compressive strength of the concrete. |
21 | House Price Prediction – Advanced Regression | The Housing dataset was compiled by Dean De Cock which is an incredible alternative and an exhaustive dataset with modernized and expanded version of the Boston Housing dataset. The target is to predict the property sales however this requires extensive feature engineering to achieve the same. |
22 | Insurance Cost Prediction | Using the data set prediction what will be the cost of the premium for health insurance. |
23 | Hospital Stay Prediction | Using the features given need to predict if a patients comes to hospital than for how many days he/she can stay |
24 | Covid Prediction | To predict number of death and recovery cases for Covid 19 |
25 | Texas Employee Salary prediction | To predict salary of employees using given features |
26 | Flight fare prediction | Uisng the dataset need to predict fare of flight |