# Final Exam Due Saturday 11:59 pm (Week 15) You cannot use any of the datasets in our assignments, class notes, and your own midterm project. If

Final Exam

Due Saturday 11:59 pm (Week 15)

You cannot use any of the datasets in our assignments, class notes, and your own midterm
project. If you are using the same one, you will receive 0 for your final project.

1. Question Formulation (5 points): You need to devise a question that can be
answered through data analysis. This question should be of your own creation,
and it should reflect your curiosity and interest.

2. Data collection (10 points): You are responsible for finding the appropriate
dataset that aligns with your chosen question. Ensure that the data is clean and
organized for analysis. If you don’t know where to find the data set, you can use
Kaggle.com It can give you more inspiration about the question formulation and
data collection. You need to state where you get your data from in order to

3. Exploratory Data Analysis (30 points): Conduct an EDA to understand the
patterns in the data. (Similar to Assignment 2.) Here are some key components of
EDA I am expecting from your paper: (6 points for each following component (if
your EDA does not have any categorical variable), or 5 points each (if your EDA
has the analysis of categorical variables.)

1) summary statistics: compute basic statistics for the dataset, such as mean,
median, standard deviation, minimum, maximum, and quartiles. It provides an
overview of the data’s central tendencies and spread.

2) Data Visualization: Create various plots and charts to visualize the data’s
distribution and relationships. Common visualization tools include histograms, box
plots, scatter plots, bar graphs, and line graphs.

3) Data Distribution: Examine the distribution of individual variables. This helps in
identifying whether the data is normally distributed, skewed, or exhibits other
patterns. Understanding the distribution can influence the choice of statistical
tests and modeling techniques.

4) Correlation Analysis: Determine the relationships between variables using
correlation coefficients or scatter plots. It can reveal potential associations and
dependencies between variables.

5) Categorical Variables (If your data involves this type of variable and you think it
is important to answer your question. If the categorical variables are not that

distribution of categorical variables using frequency tables, bar charts, or pie
charts.

6) Hypothesis Generation: Eventually your exploratory data analysis can lead to the
formulation of hypotheses about relationships or patterns in the data to answer your
question or guide further analysis.

4. Machine Learning (30 points): Build at least 3 different predictive models. (They can
answer the same question, and you will need to compare their performances and pick the best
one. Or they can answer different questions. You have a lot of flexibilities for this step, but all

5. Project Structure (20 points): While this is a mini-project, your report should follow a
structure similar to a combination of Assignment 2 and Assignment 3. This means it
should include sections for introduction, Data collection and Preprocessing, EDA,
Machine Learning, Results and Discussion, and Conclusion.

6. Data Attribution and References (5 points): In the conclusion section of your report,
make sure to include a subsection titled “Data Attribution and References.” In this
subsection, provide a detailed list of the sources where you obtained your data, including
the dataset name, the organization or website from which it was sourced, and any
relevant publication or citation information.
Additionally, if you consulted external research papers, articles, or resources during your
project, please list these references in the same section.

General Requirements
1) You will need to write up your questions, findings, interpretations, and results for

this assignment. It will be a great idea to screenshot your codes, results, and graphs
so that you can explain your findings along with them. (It is also easier for me to
follow you when I read your paper). A pdf file is required. There is no page limit but

2) The py file that you have used to finish your assignment. (It may be a duplicate or
somewhat duplicate of the screenshots that you have inserted in your paper but
that is okay. I would like to look over your codes.)

Order a Similar Paper and get 15% Discount on your First Order

## Related Questions

### instructions are attached Directions: Based on the Rescola-Wagner lectureLinks to an external

instructions are attached  Directions: Based on the Rescola-Wagner lecture Links to an external site. , and using the Rescola-Wagner formula, download this document Download document and complete the tables based on the scenario.

### Write about five food products that are imported from other countries in a three to four page maximum, double-spaced, 12-point font and word-processed

Write about five food products that are imported from other countries in a three to four page maximum, double-spaced, 12-point font and word-processed paper. Number the pages. You must include a brief bibliography in AMA format consisting of at least 10 references (two per item). Read the entire instructions before

### Provide a response to each post below: Each response must be at 150 words.1. Action research is a methodology that places a strong emphasis on

Provide a response to each post below: Each response must be at 150 words. 1. Action research is a methodology that places a strong emphasis on working together to identify issues, create solutions, and carry out changes with participants (What Is Action Research, 2024). This image illustrates the iterative process

### Instructions (contains spoilers): For this assignment, you will be required to read an article and answer a series of questions based on course material.

Instructions (contains spoilers): For this assignment, you will be required to read an article and answer a series of questions based on course material. In the article, Role of the Media in Promoting the Dehumanization of People Who Use Drugs, Habib and colleagues (2023) discuss several ways that the media

### Prepare: Prior to beginning work on this discussion forum: Read: In Laubacher et al.’s (2023) An Introduction to Social Science: Individuals, Society,

Prepare: Prior to beginning work on this discussion forum: Read:  In Laubacher et al.’s (2023) An Introduction to Social Science: Individuals, Society, and Culture Chapter 5: Sociology Chapter 6: Anthropology and Human Geography Chapter 7: Psychology and the Behavioral Sciences

### Prepare: Prior to beginning work on this discussion forum: Read: In Laubacher et al.’s (2023) An Introduction to Social Science: Individuals, Society,

Prepare: Prior to beginning work on this discussion forum: Read:  In Laubacher et al.’s (2023) An Introduction to Social Science: Individuals, Society, and Culture Chapter 5: Sociology Chapter 6: Anthropology and Human Geography Chapter 7: Psychology and the Behavioral Sciences

### Grocery Store Assignment1. Go to a grocery store that caters to foods from another country or the international sectionat a “regular” grocery store. If you

Grocery Store Assignment1. Go to a grocery store that caters to foods from another country or the international sectionat a “regular” grocery store. If you are unable to go to a grocery store, because of socialdistancing or are avoiding grocery stores, view/purchase items online.2. Write about five food products that

### Question P9-29  You must use the associated Excel worksheet to complete all assigned exercises/problems. No other format will be accepted.No Ai,

Question P9-29  You must use the associated Excel worksheet to complete all assigned exercises/problems. No other format will be accepted. No Ai, chegg, etc. Please check your work

### attached file. An asset management company must replace the manager of its two signature mutual funds, who is about to retire. Two candidates have

attached file.  An asset management company must replace the manager of its two signature mutual funds, who is about to retire. Two candidates have been short-listed. The management team is divided and cannot decide which of the two candidates would make the better mutual fund manager. The retiring manager presents

### CES-D Reverse Scoring ExerciseCENTER FOR EPIDEMIOLOGIC STUDIES—DEPRESSION SCALECircle the number of

CES-D Reverse Scoring Exercise CENTER FOR EPIDEMIOLOGIC STUDIES—DEPRESSION SCALE Circle the number of each statement, which best describes how often you felt or behaved this way – DURING THE PAST WEEK. Rarely or none of the time (less than 1 day) Some or a little of the time (1-2

### Newspaper Article Project Guidelines: Use the Seven-Step Decision Model, (textbook p. 19-20) for objectively analyzing Ethical/Bioethical Issues. You must

Newspaper Article Project Guidelines: Use the Seven-Step Decision Model, (textbook p. 19-20) for objectively analyzing Ethical/Bioethical Issues. You must locate 2 articles that have an ethical/bioethical dilemma. A dilemma is when someone has a decision to make as to do something or not. Example, Should a person be taken off

### William Fredericks 42 y/o 6′ 0″ (183 cm) 215.0 lb (97.7 kg) Reason for encounter Worsening headache and facial pain, fever get more information on

William Fredericks 42 y/o 6′ 0″ (183 cm) 215.0 lb (97.7 kg) Reason for encounter Worsening headache and facial pain, fever get more information on   Ndigen writers here

### Statistics IMG_0255.jpgIMG_0257.jpgIMG_0259.jpgIMG_0253.jpgIMG_0258.jpgIMG_0254.jpgIMG_0256.jpg

Statistics  IMG_0255.jpg IMG_0257.jpg IMG_0259.jpg IMG_0253.jpg IMG_0258.jpg IMG_0254.jpg IMG_0256.jpg

### ANNOTATED BIBLIOGRAPHY ASSIGNMENT INSTRUCTIONSOVERVIEWFor this assignment, you will create an annotated list of at least 10 resources (both

ANNOTATED BIBLIOGRAPHY ASSIGNMENT INSTRUCTIONS OVERVIEW For this assignment, you will create an annotated list of at least 10 resources (both Christian and secular) that you plan to use for your Research Paper Assignment later in the course. The resources must have been published within the last five years. These sources

### please see attachmentThis assignment has two parts: 1. Identify and discuss the quality management objectives in a healthcare

please see attachment This assignment has two parts: 1. Identify and discuss the quality management objectives in a healthcare organization, including at least four (4) definitions of introductory quality management terms. 2. Identify and discuss how to apply systems thinking strategies to healthcare quality control and management processes. This is

### please see attachmentplease review the following list of potential problem/situations and choose one for your final assignment, Strategic HR Plan.

please see attachment please review the following list of potential problem/situations and choose one for your final assignment, Strategic HR Plan. You can write your paper as if you were employed at a Fortune 500 company, a current organization that is in the news, or a past or present employer.

### please see atachmenta) demonstrates proficiency in setting up an ethnographic research project; b) describes a strategic site for ethnographic

please see atachment a) demonstrates proficiency in setting up an ethnographic research project; b) describes a strategic site for ethnographic research; and c) conducts ethnographic research (via observation only) and presents some preliminary findings. Your paper should utilize sound critical thought, refer to course materials, and be written in APA-format