In Progress

Train and evaluate a Machine Learning model on the given dataset.

Consider the time taken to verify a single bug is a minimum two hours where the bugs can be from the field or are found in-house. If a test team verifies hundreds of bugs per year, for 250 defects, that is 100 days of work on nothing else but this. Create a training model that will predict which defects were incorrectly fixed based on previous data (i.e., previously verified defects).

The bug prediction dataset contains data about the following software systems:

- Eclipse JDT Core

- Eclipse PDE UI

- Equinox Framework

- Lucene

- Mylyn

Justify whether to use supervised/unsupervised/reinforcement learning for the task.

1. Import Libraries/Dataset

a. Download the dataset

b. Import the required libraries

2. Data Visualization and Exploration

a. Print at least 5 rows for sanity check to identify all the features present in the dataset and if the target matches with them.

b. Print the description and shape of the dataset.

c. Provide appropriate visualization to get an insight about the dataset.

d. Try exploring the data and see what insights can be drawn from the dataset.

3. Data Pre-processing and cleaning

a. Do the appropriate preprocessing of the data like identifying NULL or Missing Values if any, handling of outliers if present in the dataset, skewed data etc. Apply appropriate feature engineering techniques for them.

b. Apply the feature transformation techniques like Standardization, Normalization, etc. You are free to apply the appropriate transformations depending upon the structure and the complexity of your dataset.

c. Do the correlational analysis on the dataset. Provide a visualization for the same.

4. Data Preparation

a. Do the final feature selection and extract them into Column X and the class label into Column into Y.

b. Split the dataset into training and test sets.

5. Model Building

a. Perform Model Development using at least three models, separately. You are free to apply any Machine Learning Models on the dataset. Deep Learning Models are strictly not allowed.

b. Train the model and print the training accuracy and loss values.

6. Performance Evaluation

a. Print the confusion matrix. Provide appropriate analysis for the same.

b. Do the prediction for the test data and display the results for the inference.

Skills: Python, Machine Learning (ML), Artificial Intelligence, Data Science, Data Mining

See more: machine learning primate factors dataset animal behaviour researcher, using flask to serve a machine learning model as a restful webservice, embedding a machine learning model into a web application, how to build a machine learning model, machine learning model architecture, how to deploy python machine learning model, azure machine learning model management, deploy machine learning model to production, deploy machine learning model python, how to improve machine learning model, validate machine learning model, how to test a machine learning model, machine learning model development process, machine learning model steps, machine learning model accuracy, building a complete machine learning model end to end, deploy machine learning model flask, deploy machine learning model django, how to train a machine learning model, how to apply machine learning model to new dataset

About the Employer:
( 0 reviews ) Tirupati, India

Project ID: #30586937

Awarded to:

pkundu25

I am an experienced IT professional and a Data Science practitioner. Your job caught my eye and looks to be quite interesting to me as I did similar work in recent past .I have developed various algorithms pertaining t More

₹3000 INR in 3 days
(0 Reviews)
0.0

27 freelancers are bidding on average ₹8481 for this job

(106 Reviews)
6.1
(71 Reviews)
6.2
ibrahimanjum330

Hi, I am Ibrahim, and I am a data scientist, I can help you train and evaluate ML models, please share the dataset as well. Regards, Ibrahim Anjum

₹2500 INR in 3 days
(61 Reviews)
5.8
FirmMinds123

We have AI & Data Science,Django team who are highly experienced in Machine learning and Deep Learning and can deliver products as per your requirements. We have done many real time projects like Semantic search engine More

₹9000 INR in 4 days
(29 Reviews)
5.1
suyashdhoot

Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp More

₹30000 INR in 7 days
(46 Reviews)
6.2
ScienceInTheWay

Hi, hope you’re doing well. I am a Python Developer and have rich experience in Machine Learning and Deep Learning, you can check my reviews. I always deliver high quality of work within accepted time limit and budget More

₹7000 INR in 7 days
(60 Reviews)
5.0
(11 Reviews)
4.6
sajjadtaghvaeifr

Hi, I hope you are doing fine. I have almost 10 years of experience in machine learning algorithms. I can implement various types of artificial intelligence algorithms including yours with Matlab, Python and etc. I hav More

₹7000 INR in 7 days
(8 Reviews)
3.8
Sam0072

I am a machine learning engineer having 5+ year of experience. My skills-- Machine learning,deep learning,image processing,Open CV, kaggle project, python, R, data analysis, software development. Deploy ml models to More

₹5000 INR in 4 days
(22 Reviews)
4.1
Gozienkwocha

Hello Sir/Ma'am, I can build a machine learning model to predict incorrectly fixed defects. I'm a data scientist and machine learning expert with 3 years of experience. I can complete the work within the set deadl More

₹3500 INR in 4 days
(8 Reviews)
3.5
ouhsassa

Hello, I am an independent, experienced Machine learning expert. I can help with this task with a quick turn-around. Looking to hearing from you. Kind regards.

₹12500 INR in 5 days
(4 Reviews)
2.6
Gkiprop

Dear employer, I hold master's in statistics and continuing with Ph.D. in applied statistics making me suitable for this job. Am an experienced statistic writer for three years. Besides I am a skilled; R programmers sp More

₹7000 INR in 2 days
(5 Reviews)
2.1
sasidharpurum

hai, I am proficient in ML , computer vision and data science. I gone through your requriments and I done similar projects before. Send me details . I can develop your project as per your requriment and deliver as so More

₹8000 INR in 2 days
(0 Reviews)
0.0
shahsmit01042000

I have been making this Kind of projects for a long time now, I have experience in this field you can check my GitHub profile ([login to view URL]) and kaggle profile ([login to view URL]) where More

₹7500 INR in 5 days
(0 Reviews)
0.0
Aitruework

*****9792979142****** Hi. I am a data scientist. I am very familiar to Deep learning apis such as Tensorflow and fastai, mxnet. I have a good hands on working with Advanced R and Python and BI tools and technologies, A More

₹6000 INR in 2 days
(0 Reviews)
0.0
mangasaini

HELLO, I have read the instructions keenly and understood your specifications for the [login to view URL] this field i have adverse experience since it is my area of specialization. My skills are adequate, and I guarantee total sa More

₹6000 INR in 5 days
(0 Reviews)
0.0
imsubhamsrivatsa

I have done some projects which is similar to the task which is mentioned in this one . That's why i think i can do this task

₹7000 INR in 2 days
(0 Reviews)
0.0
vennalmicheals

Hi , Am a Data Scientist working in a Big MNC Training models is my everyday job, Also a freelancer during free time , I can do this for win win amount which I have quoted, Ping back for more details

₹5000 INR in 3 days
(0 Reviews)
0.0
adhityaresearch

Hello, this is Dr [login to view URL], Chennai, Tamilnadu, India. I have around 19 years experience for both teaching and research. Including around 12 years teaching experience in the Department of Computer Science and Eng More

₹12000 INR in 7 days
(0 Reviews)
0.0
gaurav4746

Hello, I have gone through your problem statement, and i would like to tell you that i can do this. I have done various type of projects related to that. So, if you want i can do this. Thank You!

₹7000 INR in 7 days
(1 Review)
0.0