University | University of London (UOL) |
Subject | DSM010 Big data analysis Course Work |
DSM010 Big data analysis Course Work, UOL, Singapore: Find the descriptive statistics for temperature of each day of a given month for the year 2007
Coursework Description
For this coursework, you will solve the given problems using the MapReduce computational model and Mahout on the Hadoop cluster. This coursework carries 30% weightage of total marks for the module.
Q1) Find the descriptive statistics for temperature of each day of a given month for the year 2007.
We use weather data from NCDC. You can access hourly weather data from ‘Data Sets’ folder under “Coursework submission” tab on the module VLE. We have chosen the hourly records of April, May, June and July from the year 2007. A month is represented per file. You may select any one of the four months (files) for analysis.
Hire a Professional Essay & Assignment Writer for completing your Academic Assessments
You can find the weather data from different weather stations (wban – first column). Using the hourly data across all of the weather stations, find:
• The difference between the maximum and the minimum, “Wind Speed,” from all of the weather stations for each day in the month
• The daily minimum, “Relative Humidity,” from all of the weather stations
• The daily mean and variance of, “Dew Point Temp,” from all of the weather stations
• The correlation matrix that describes the monthly correlation among, “Relative Humidity”, “Wind Speed” and “Dry Bulb Temp,” from all of the weather stations.
You are NOT going to use any package that gives the statistics. You MUST use the MapReduce framework. Write the pseudo code for mapper and reducer functions for the above four tasks and implement them in Python. Note that while using mapper and reducer it is helpful to consider the following formulae for variance and correlation:
Buy Custom Answer of This Assessment & Raise Your Grades
Looking for top-notch assignment help in Singapore? Our platform caters to Singaporean students at institutions like the University of London (UOL), providing specialized assistance in various academic tasks including TMA, individual assignments, and more. Specifically, for DSM010 Big Data Analysis Course Work at UOL Singapore, tackling tasks such as descriptive statistics for daily temperatures in 2007 is our forte. With our Essay Writing Services and expertise in academic support, students can confidently navigate challenges, ensuring a smoother academic journey at UOL Singapore while excelling in their coursework.
- Imagine that you are currently working for a precision medicine startup: Machine Learning Paper Review in Precision Medicine, Written Assignment 1, NUS, Singapore
- Go Business offers PSG solutions for enterprises in Singapore: Collective Intelligence and Entrepreneurship, Assignment 1, JCU, Singapore
- Design an ontology based on- Accidents can be categorised as chemical, electrical, fire, kinetic or liquid: Collective Intelligence and Entrepreneurship, Assignment 1, JCU, Singapore
- Project Control Monitoring, Assignment, HU, Singapore: Deliberate Project MONITORING AND CONTROL or PMC Identification
- DSM500: Final Project Report, Coursework 2, UOL, Singapore
- DSM080: Financial Markets, Assignment, UOL, Singapore: A grapefruit juice futures contract is for 15,000 pounds of frozen grapefruit juice
- DSM080: Financial Markets, Assignment, UOL, Singapore: The share price of a certain stock today is $42.50, and five-month European style call options with a strike price of $45 currently sell for $4.25.
- DSM080: Financial Markets, Assignment, UOL, Singapore: A trader who is working in the gold markets is able to borrow money at the interest rate of 7% per annum
- DSM080 Financial Markets, Assignment, UOL, Singapore: A dollar-based American corporation has decided that it will have to pay 6 million UK pounds in three months
- Principle of Finance Assignment, UCD, Singapore: National Gaming Inc. (National) operates the weekly lottery in the country
UP TO 15 % DISCOUNT