If you want to be an AI expert, then data science is the very first step for you. In this blog we will discuss 8 beginner’s projects which can help you to sharpen your data science knowledge.
Iris Data Set
This is one of the most versatile projects which will help you to learn how to do classification. For absolute beginners, this is the perfect project. This dataset has only 150 rows and 4 columns.
What is the project?
To predict the class of flowers based on available attributes.
Click here to get the Data: https://archive.ics.uci.edu/ml/datasets/Iris
Loan Prediction Dataset
If we analyze all of the industries, we will find that the insurance domain is the one that has the highest use of analytics and data science methods.
Loan prediction is the best way to learn how to develop strategies when you have a lot of variables that can influence the data. This is basically a higher-level classification problem. This data has 615 rows and 13 columns.
What is the project?
To predict if a loan will get approved or not?
Click here to get the Data: https://www.kaggle.com/altruistdelhite04/loan-prediction-problem-dataset
BigMart Sales Dataset
Apart from the insurance industry, the retail industry is also the one that uses data analytics most. Just imagine, how the companies like big-bazaar and Grofers know what do you like, and they showcase the same products whenever you open the platform. That’s because they study consumer behavior and using data science they predict the outcome.
In this particular data set you will be able to understand how to do product placement, inventory management, customized offers, product bundling, etc. are being smartly handled using data science techniques.
This is basically a regression problem. This dataset has 8523 rows and 12 variables.
What is the project?
To predict the sales of the store.
Click here to get the Data: https://www.kaggle.com/brijbhushannanda1979/bigmart-sales-data
Boston Housing Dataset
Boston Data Set is a very small data set but in the process of learning and understanding the data science, it’s very important.
This data has 506 rows and 14 columns.
What is the project?
To predict the median value of owners occupied homes.
Click here to get the Data: https://www.kaggle.com/c/boston-housing
Time Series Analysis Dataset
You need to understand the time series analysis to do the tasks like weather forecasting, predicting sales, analyzing year-on-year trends.
What is the project?
To predict the traffic on a new mode of transport.
Click here to get the Data: https://www.kaggle.com/lampubhutia/timeseries-modelling-predicting-traffic-growth