Small Data Analysis project (Basic)

PROJECT DESCRIPTION:
This project is comprised of a number of small tasks. To accomplish the tasks, you would need to review the provided 9 datasets. (But only need to use 4 data sets for the project) and develop research questions that you find plausible and interesting. Then, you will use Excel to split, or merge, or manipulate the relevant data, perform the analysis and present the results. Note that not all information (rows, columns) needs to be used, selecting the interesting information to be used is part of your task and will depend on your research questions.

PROJECT TASK:
Tasks: you would need to use ALL quantitative methods introduced in this course to solve your own research questions.

The methods that you need to use include:
1) Descriptive methods, Pie Charts, Bar Charts, Line Charts, and Histograms (Lecture 2);
2) Normal Distribution (Lecture 3 and 4);
3) Simple Random Sampling (Lecture 5);
4) Sample Distribution (Lecture 5);
5) Confidence Interval (Lecture 5);
6) Hypothesis Testing of one population (Lecture 6);
7) Hypothesis Testing of two population (Lecture 7);
8) Association Testing, linear regression (Lecture 8 and 9);

You would need to use at least 4 different datasets to develop your research questions with the quantitative methods. Your research questions are not necessarily connected. They can be separate questions on different datasets. (You are also free to use other datasets of interest. But the datasets must contain more than 10,000 rows and 5 columns. A small dataset is not accepted.)
You would need to write a short report that summarizes your research questions and the datasets you have used. The report should contain: introduction, research questions with brief motivations, corresponding datasets, brief methodology, discussion of the results, conclusions.
1. The report should contain 1200  words (excluding graphs, images and tables).
2. Separate files, Excel files with and without formula should be submitted too.

RESEARCH QUESTION EXAMPLE
The following are some research question examples using the provided datasets. Those research questions are only for inspiration. You can simply use the following questions. You are also strongly encouraged to develop more questions based on your own interest and investigation.

1-Data_amazon_consumer_review
Will the ratings of electronics product and home and office accessories in Amazon the same? (Two population hypothesis testing)
Will the ratings after 2017 better than the average rating before 2017? (One population hypothesis testing)

2-Data_FIFA_2017
Illustrate the average wage of the football stars across different countries. (Using Pie chart, Bar char……)
Illustrate the histogram of Spanish/Brazilian/ football stars wage/ball control/dribbling.
If we randomly select 5% football starts from the datasets as a small sample, can we find out the confidence interval of their wages? (confidence interval)
Is Spanish football stars wage higher than England football stars wage? (Two population hypothesis testing)
Is England footballer faster than the Spanish footballer? (Two population hypothesis testing)
Is Brazilian footballer dribbling better than the average dribbling score of England footballer? (One population hypothesis testing)
Can the factors as Acceleration, Aggression, Agility, Balance, Ball control, Dribbling etc. explain the footballers wage? Which factor is more significant? (linear regression)

3-Data_Football_events
Does shooting from shot_place #3 have higher probability of a goal than shooting from all other places? (histogram, normal distribution)
Where does Lionel Messi like to shoot the most? (histogram, normal distribution) Where does Cristiano Ronaldo most likely to goal? (histogram, normal distribution)
Do the factors such as shot_place, shot_outcome, location, assist_method, etc. significantly contribute to a goal? (logistic regression)

4-Data_hotel_Reviews
Do the hotels in UK have the higher review score than the hotels in US? (Two population hypothesis testing)
Can the negative review word counts and positive review word counts explain the review score? (linear regression)

5-Data_LA_restaurant-health-violations
Are the scores of restaurants having violation code F001 lower than the scores of restaurants having violation code F030? (Two population hypothesis testing)
Among the restaurants with scores higher than 90, which rule do they most likely to violate? (histogram, normal distribution)

6-Data_RedWine
Is the Spain wine more expensive than US wine? (Two population hypothesis testing)

7-Data_sales
Are the sales in holiday higher than non-holiday? (Two population hypothesis testing)
Is there any relation between the temperature and the sales? Is there any relation between the fuel price and sales? Do the Unemployment %, IsHoliday, CPI, Temperature (F), Fuel_Price, etc. explain the sales? (Association Testing, linear regression)

8-Data_Sweden_Airbnb
Do the factors as host_response_rate, is_location_exact, minimum_nights, maximum_nights, and etc. explain the price of the Airbnb? (Association Testing, linear regression)

9-Data_Youtube_GBvideos
Do people like videos in category 10 more than category 2? (Two population hypothesis testing)
Is there any relationship between the views and likes? If there is, is it positive or negative? (Association Testing, linear regression)

Childcare Director/Owner Student Project Operations Improvement Plan

PLEASE READ IN DETAIL:

Student Project Operations Improvement Plan
Prepare a typewritten project paper describing your plan to improve the process (TASK) you currently work in (Childcare Director), presented with Problem Statement and Flowcharts. This plan should identify and explain in details of the process, including graphical representations and flowcharts.

This paper must be a coherent unification of operation management, and quality processes quantifying the magnitude of the anticipated improvement, e.g., quantifiable cycle time reductions, inventory cost reductions.reduction of labor cost or a significant differentiation of product/services.

Data depicting the results of the process improvement is required. You should be able to produce data, and present a data analysis that will be the basis for your recommendations. If data is not available, (you can build data on an assumption basis) outline how such data could be obtained, what the data might be expected to reveal, and how you would evaluate the success of the improvement project. In either case, be sure to provide a detailed explanation of how you arrived at your conclusions and recommendations. Minimum 20-30 pages.

Follow this Outline:
Mission
SWOT-Strengths, Weaknesses, Opportunities and Threats

Strategy

Company Profile
Process Narrative
Process(Current) Flow Charted
Problem Identification
Benchmark- compare with similar processes- the industry does not have to be simular
New Proposed Process Narrative
Process (new) Flow Charted

Incorporate at least 2 quality tools (reference chapter 13 of text)
Cost Benefit = cost – benefit- DETAILED
Recommendation

Implementation
Conclusion/Summary-( Strong Close)

Textbook is Managing for Quality and Performance Excellence
ISBN-13: 9781285069463

Web Crawler

Need to design a web crawler using C language, the specifications for this assignment is in that project 1 web crawler pdf. There’s two additional documents for starting tips.
My tutor said the sort of http libraries that you shouldn’t use are  libcurl.  (as an exception, you can use the  HTML  parser  htmltidy.c from libcurl.)
Also they said client.c could be a good starting point, most of them actually use that file to modify to make it work as a crawler.

Business plan for Platform Business

In the event of a corona virus, think about which business is the most profitable and write a business plan. Suppose the corona event lasts 12 months. As a company CEO, write how you would run the company. From the start, you must anticipate everything from capital, items, sales, profits after 12 months, etc.

NO PLAGIARISM!!!!!!