Calculate the descriptive statistics fromthe data and display in a table. Be sure to comment on the central tendency,variabilityand shape for housing price and two additional

Descriptive statistics, inferential statistics & multiple linear regression
Paper, Order, or Assignment Requirements

Instructions:

This is a group assignment with a minimum group size of two and a maximum group size of three. All group members will receive the same marks for the assignment. All group members must be enrolled in the same tutorial. The assignment must be provided in the form of a (brief) business reportapproximately 5–9 pages (including this cover page). You must submit an electroniccopy of your assignment in Blackboard. Hard copies will not be accepted.SHOW YOUR WORK for Calculation based questions.

This assignment requires the use of Microsoft Excel. If you have Windows, you will also need to use the Data Analysis ToolPak. If you have a Mac, you will need to use StatPlus:MAC LE.

Group Members:

First name Last name StudentID

Please indicate your tutor and tutorial time:

Tutor
Tutorial date and time

Problem Description:

You are consultants working with an online real estate appraiser, onthehouse.com.au. In order to better calibrate their models to predict housing prices, your supervisor has asked your group to develop a model to appraise the price of homes in a capital city of Australia based on characteristics of the home and the surrounding neighbourhood. In economics, this is commonly called a “Hedonic Regression.”[1]

You will use descriptive statistics, inferential statisticsand your knowledge of multiple linear regression to complete this task.

Housing data for 100 single-family units lists housing price data (in $000s) (Dependent Variable)and several characteristics of the home and neighbourhood(Independent Variable) for a capital city in Australia are given in the Excel file: Monday.xlsx.

Here is a table describing the variables in the data set:

Variable Definition
Price Price of sold single-family home is $000s
Bed Number of bedrooms in the house
Dis Distance to nearest CBD in kilometres
Floor Area of home is square metres
School State ranking of nearby public secondary school. Varies from 0 to 100 points.
Train Dummy Variable indicating whether a train station is located within 500 metres

Required:

Calculate the descriptive statistics fromthe data and display in a table. Be sure to comment on the central tendency,variabilityand shape for housing price and two additional (1 Mark)
Draw a graph that displays the relative share of bedrooms in the sample. (1 Mark)
Create a box-and-whisker plot for the distribution of the price of the homes and describe the shape. Is there evidence of outliers in the data? (1 Mark)
What is the likelihood that a house is both over $600,000 and more than 10 kilometres from the CBD?Is the price statistically independent of distance? Use a Contingency Table. (2 Marks)
Estimate the 90% confidence interval for the population mean housing price. (1 Mark)
Your supervisor recently stated that it is obvious that the mean housing price is greater than$610,000,which was the average price of housing sold last year. Test his claim at the 5% level of significance. (1 Mark)
Run a multiple linear regression using the data and show the output from Excel. (1 Mark)
Is the coefficient estimate for the number of bedrooms statistically different than zero at the 5% level of significance? Set-up the correct hypothesis test using the results found in the table in Part (G) using both the critical value and p-value approach. Interpret the coefficient estimate of the slope. (2 Marks)
Interpret the remaining slope coefficient estimates.Comment on whether the signs are what you are expecting. (2 Marks)
Interpret the value of the Adjusted R2. Is the overall model statistically significant at the 1% level of significance? Use the p-value approach. (1 Mark)
Do the results suggest that the data satisfy the assumptions of a linear regression: Linearity, Normality of the Errors, and Homoscedasticity of Errors? Show using scatter diagrams, normal probability plots and/or histograms and Explain. (3 Marks)
Based on the results of the regressions, is it likely that other factors have influenced housing prices? If so, provide a couple possible examples and indicate whether these would likely influence the regression results if they were included. (1 Mark)
If a community housing organisation asked for information regarding the characteristics of housing targeting the households of Aboriginal and Torres Strait islanders, explain whether a simple random sampling technique would provide an accurate representation of these households. (Note: This question does not use the data)(1 Mark)
[1]http://www.investopedia.com/terms/h/hedonicpricing.asp

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

What are the major characteristics of the men in this data set (such as their age, education level, employment status, and earnings if employed)?

Introductory Statistics & Data Analysis assignment
Order Instructions/Description

Suppose that you work as an analyst for a government agency, which funds various job
training programs. You are assigned to analyse the above data set. In particular, your boss
Sam wishes to know the effectiveness of the training. Sam notices that the proportion of the
employed among the 445 men after training is 69.2%, which is much greater than the
proportion of the employed before training (35.1%). Sam is unsure whether this increase in
employment rate is related to the training, or is simply a consequence of general
improvements in the job market. Sam posts the following questions.
(a) What are the major characteristics of the men in this data set (such as their age,
education level, employment status, and earnings if employed)?
(b) After the training program, is the average earnings of the employed with training
higher than the average earnings of the employed without training?
(c) After the training program, is the employment rate of the men with training higher
than the employment rate of the men without training?
(d) Do you have other ways to demonstrate the effect of training on employment?
“You should be able to find answers to these questions with some of your statistical skills that
you always boast about.” Sam demands cheerfully.

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

Produce the relevant graph and table to summarise the ‘CRIME’ variable and write a paragraph explaining the key features of the data observed in the output in the style presented in the course materials.

Statistics Assignment
Paper, Order, or Assignment Requirements

STA10003 FOUNDATIONS OF STATISTICS ASSIGNMENT

This assignment is worth 20% of your final mark.

Scenario

You are a new graduate at a social science and psychological sciences research institute, and the lead researcher has given you a dataset to analyse. You will be graded based on the rubric attached to these instructions.

The data set is based on Australian victims of crime statistics from the Australian Bureau of Statistics [ABS] collected over the period from 2010 to 2014. The original data file contained over 165,000 observations extracted from various ABS catalogues[eg Cat. 4510 /Cat. 4530.0], and a representative subset of the original data containing 8292 observations [STA10003_Assignment_SP3_2015.sav] is located in the Assignment Information tab on Blackboard.

Data Preparation

Before attempting the Assignment questions, you must use SPSS to draw a random sample of 2000 [from the 8292 cases]. You will conduct your analysis on your sample of 2000 observations. Instructions on how to generate your random sample is attached. Note, however, that some variables contain missing values, so each of your analyses may not contain the entire 2000 cases.

Submission Instructions

Your submission must be a single Word file or PDF file and should contain the relevant output.
You must submit your file via the Turnitin link on Blackboard by the specified due date and time. Only the last document you submit will be retained by Turnitin.
This is an individual assignment. Do not share your work with other students. They will have a different random sample of data, so any copying will be detected by Turnitin.

Video demonstrations

Videos showing how to prepare your data and how to submit via Turnitin are provided for you in the Assignment Information tab [ > Assignment Information folder] on Blackboard.

For your assignment, you are required to complete the following five [5] questions by producing the appropriate analyses and writing the relevant report for each question.

Note: For each question you should include the relevant output with your report.

Question 1

The variable ‘CRIME’ indicates the type of crime reported by Australian victims of crime during the period 2010 – 2014. Produce the relevant graph and table to summarise the ‘CRIME’ variable and write a paragraph explaining the key features of the data observed in the output in the style presented in the course materials.

Question 2

The variable ‘INCIDENTS’ measures the number of times that the respondent – or a member of their household – has been a victim of crime at the time this incident was reported. Produce the relevant graph and tables to summarise the ‘INCIDENTS’ variable and write a paragraph explaining the key features of the data observed in the output in the style presented in the course materials.

Question 3

The variable ‘WEAPONTYPE’ indicates the type of weapon used during the crime.

Previous research has shown that 79% of all crimes are committed without any type of weapon being used. It has been suggested that the percentage of crimes committed without a weapon being used is now lower than this.

Conduct a Binomial test using the ‘WEAPONTYPE’ variable to test this claim. Produce the relevant output and write a Binomial test report based on your output in the style presented in the course materials.

Question 4

The variable ‘AGE’ indicates the age of the respondent at the time the crime was committed.

Previous research has indicated that the average age of victims of crime is 24 years. It is expected that the average age of victims of crime is currently higher than this.

Conduct a One-sample t-test using the ‘AGE’ variable to test this claim. Produce the relevant output and write a One-sample t-test report based on your output in the style presented in the course materials.

Question 5

Researchers predict that the average number of incidents of crime is higher for females than for males. Conduct an Independent samples t-test using the ‘INCIDENTS’ and ‘SEX’ variables to test this claim. Produce the relevant output and write an Independent samples t-test report based on your output in the style presented in the course materials.

How to generate your random sample of 2000 observations:

Open the STA10003_Assignment_SP3_2015.sav data file. From the Transform drop-down menu, select Random Number Generators:

From the Random Number Generators dialogue box, click the Set Active Generator and Set Starting Point as shown below. Click OK:

From the Data drop-down menu, select Select Cases:

From the Select Cases dialogue box, choose Random Sample of Cases and then click the Sample button [the Sample button is in grey-scale until you select the Random sample of cases choice]:

From the Select Cases: Random Sample dialogue box, click Exactly and type 2000 cases from the first 8292.
[again the information is in grey-scale until you select ‘Exactly’. We want to generate a random sample of 2000 from the entire data set, so enter 2000 cases from the first 8292 cases]:

After clicking Continue [this returns you to the Select Cases Dialogue Box] you will see next to the Sample button confirmation that 2000 cases have been selected:

We can remove the unselected cases by clicking the Delete unselected cases button under the Output heading:

After clicking OK, your data set will now only show the 2000 cases selected.

You should now save the data file with a new name. The data file is ready to use for your Assignment!

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

Is burnout and retention more prevalent among providers who work with mentally challenged patients? What are some resources that could be employed to reduce burnout and increase employee retention?

Statistics and data analysis
Paper, Order, or Assignment Requirements

analyze the below questions by comparing burnout/and or retention between two groups, providers who work with mentally-challenged patients, and providers who work with mainstream patients. You can compare burnout scores from a survey using an independent samples t-test. Or, you can compare retention-related percentage with a chi square.

Research Question:

Is burnout and retention more prevalent among providers who work with mentally challenged patients? What are some resources that could be employed to reduce burnout and increase employee retention?

Assignment 2 will contain your Introduction section from Assignment 1, will add content to the Literature Review, will contain the Methodssection describing how you will collect and analyze your data, and will contain an initial draft of your survey.

Literature Review (Phase 2):

Select two additional references that support the methods you’ll use to collect (survey) and analyze your data (regression, t-test, ANOVA, etc.). Write two-to-three paragraphs describing the content of each reference. In your paragraphs, provide a brief background on how each reference relates to the data-collection portion of your research project.

Methods (address the following):

Population and sample

Instrumentation (survey)

Data Collection

Data Analysis

Survey

Provide an initial draft of your survey along with the descriptions/directions needed for others to understand how to respond.

Rubric

Assignment 2: Literature Review (Phase 2), Methods, Survey

Assignment 2: Literature Review (Phase 2), Methods, Survey

Criteria Ratings Pts

CONTENT: Include enough information to initially address: • Literature Review (Phase 2) o Two references • Methods o Population and sample o Instrumentation (survey) o Data Collection o Data Analysis • Survey Draft

40 pts

ORGANIZATION/DEVELOPMENT: • The Literature Review contains a two-to-three paragraph description of each of two references. The paragraphs adequately relate the references to the data collection and analysis methods for the proposed research project. • The Methods section provides sufficient information to describe the data collection and analysis for the proposed research project. • The survey draft lays out questions and gives enough instruction and description for participation.

40 pts

MECHANICS: • The paper is consistent with APA formatting guidelines and meets course-level requirements. • Intellectual property is recognized with in-text citations and appropriate reference(s). • The paper is laid out with effective use of headings, font styles, and white space. • Rules of grammar, usage, and punctuation are followed; spelling is correct.

20 pts

Total Points: 100

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

What is the conditional relative frequency of those aged over 25, given that they are female?

Statistics Multiple Choice
Paper, Order, or Assignment Requirements

Multiple choice:

To halve the margin of error of an opinion poll in general it is necessary to [1 mark]
Halve the sample size
Double the sample size
Triple the sample size
Quadruple the sample size

Which of the following divides a sample into two equal parts? [1 mark]
Mean
Median
Standard deviation
2 x standard deviation

A list of 5 pulse rates is: 70, 64, 80, 74, 92. What is the median for this list? [1 mark]
74
76
77
80

The following questions (4 and 5) refer to the contingency table below which classifies students by age and sex

AGE (years) male female

≤ 25 120 30

> 25 20 40

What is the marginal relative frequency of males? [1 mark]
3/5
4/7
2/3
None of the above

What is the conditional relative frequency of those aged over 25, given that they are female? [1 mark]
2/7
3/7
4/7
None of the above

If the correlation between the number of cigarettes smoked in a lifetime and the incidence of cancer is 0.35, then [1 mark]
Smoking causes cancer
Cancer causes smoking
There is a third factor that caused both smoking and cancer
No conclusion can be drawn
What is the correlation coefficient for the three pairs of numbers x and y?
x -2 0 4

y 1 0 -2

-1
0
1
None of the above

The p-value obtained from a two tailed test of a null hypothesis that a parameter value is zero against an alternative hypothesis that the parameter is larger than zero is the probability that [1 mark]
The null hypothesis is true
The observed value of the test statistic will occur if the null hypothesis is true
The observed value of the test statistic is statistically significant
Any value as large as or larger than the test statistic will occur if the null hypothesis is true

Which of the following statistical test is applicable for one sample from a skewed distribution? [1 mark]
Z-test
Student’s t-test
Chi-square test
Correlation test

A contingency table has 6 rows and 3 columns. So the degrees of freedom for a test of the hypothesis that the two variables are independent is [1 mark]
2
18
17
10

A distinction between a population parameter and a sample statistic is:
[1 mark]

A population parameter is only based on conceptual measurements, but a sample statistic is based on actual measurements.
A sample statistic changes each time you try to measure it, but a population parameter remains fixed.
A population parameter changes each time you try to measure it, but a sample statistic remains fixed across samples.
The true value of a sample statistic can never be known but the true value of a population parameter can be known.

A survey asked people how often they exceed speed limits. The data were then categorized into the following contingency table of counts showing the relationship between age group and response. [1 mark]
Exceed speed limit

AGE (years) sometimes never

≤ 30 100 100

> 30 40 260

What is the relative risk of exceeding the speed limit for people under 30 compared to people over 30?

3.75
0.4
0.27
40%

A chi-square test involves a set of counts called “expected counts.” What are the expected counts? [1 mark]
Hypothetical counts that would occur if the alternative hypothesis were true.
Hypothetical counts that would occur if the null hypothesis were true.
Hypothetical counts assuming all categories are equally likely.
The counts over a long time that would be expected if the observed counts are representative.

A pharmaceutical company claims that its antihypertensive drug allows people to reduce their blood pressure by 8mm of mercury after one month of treatment. If we want to conduct an experiment to determine if the patients‟ blood pressure is not being reduced to the level advertised, which of the following hypotheses should be used?[1 mark]
H0: μ = 8; Ha: μ > 8
H0: μ = 8; Ha: μ ≠ 8
H0: μ = 8; Ha: μ < 8
H0: μ ≠ 8; Ha: μ < 8

simple random sample of size n = 25 is drawn from a population with mean 50 and standard deviation 5. What is the standard deviation of the sample mean x ?
[1 mark]

1
2
5
10

SHORT ANSWER QUESTIONS

You’ve just been appointed to work as a Health Promotion Officer in Longreach, central Queensland. The management of diabetes is a critical local health issue. An advanced medical clinic was recently set up in the centre of town, but those most in need, particular non-town residents, do not appear to be using it. You think access might be a critical barrier. You’ve read a recent evaluation report about a mobile service operating in a similar area that involves a multidisciplinary team travelling to patients in remote locations.
Your supervisor thinks this might be a good option, but she wants evidence that the community would use such a service. She asks you to conduct a survey, but you suggest that before you do that you should examine community perceptions in more detail. You suggest a qualitative study would be a good place to start.

Briefly identify one reason to justify your proposal to conduct a qualitative study? [1 mark]
Identify two methods you could use to collect data on this issue. [2 marks]
What sample strategy will you use to identify people for your study? Provide an explanation of the sample strategy you have chosen and briefly justify why you have chosen it. [2 marks]

To determine whether the mean nicotine content of a brand of cigarettes is greater than the advertised value of 1.4 milligrams, a health advocacy group tests the null hypothesis H0 : µ = 1.4 against the alternative hypothesis Ha : µ > 1.4
The sample of 100 cigarettes had a mean of 1.2 milligrams and the population standard deviation was known to be 0.8. A z-test statistic was calculated.

Explain why a z-test statistic was used instead of a t-test statistic. [1 mark]
Calculate the standard error of the sample mean. [1 mark]
Calculate the z-test statistic. [1 mark]
The p-value derived was 0.006. Is this significant at the 1% level? [1 mark]
Explain the result of the test in words to someone who knows no statistics
[1 mark]

An opinion poll company asked a random sample of 1009 adults which causes of death they thought would become more common in the future. Topping the list was car accidents: 70% of the sample thought deaths from car accidents would increase.
How many of the 1009 people interviewed thought deaths from car accidents would increase? [1 mark]
The margin of error for this poll is reported to be plus or minus 3 percentage points. Explain to someone who knows no statistics what “margin of error plus or minus 3 percentage points” means. [1 mark]
Give a 95% confidence interval for the proportion of people who think deaths from car accidents will become more common. [1 mark]
Explain what this confidence interval means. [1 mark]
Compare your confidence interval with the margin of error of 3 percentage points reported by the company – why or why not do they differ? [1 mark]

The Abstract below is from the following paper:
Law, C.-k., Sveticic, J. and De Leo, D. (2014), Restricting access to a suicide hotspot does not shift the problem to another location. An experiment of two river bridges in Brisbane, Australia. Australian nd New Zealand Journal of Public Health, 38: 134–138.

Background: Restricting access to lethal means is a well-established strategy for suicide prevention. However, the hypothesis of subsequent method substitution remains difficult to verify. In the case of jumping from high places („hotspots‟), most studies have been unable to control for a potential shift in suicide locations. This investigation aims to evaluate the short- and long-term effect of safety barriers on Brisbane’s Gateway Bridge and to examine whether there was substitution of suicide location.

Methods: Data on suicide by jumping – between 1990 and 2012, in Brisbane, Australia – were obtained from the Queensland Suicide Register. The effects of barrier installation at the Gateway Bridge were assessed through a natural experiment setting. Descriptive and Poisson regression analyses were used.

Results: Of the 277 suicides by jumping in Brisbane that were identified, almost half (n=126) occurred from the Gateway or Story Bridges. After the installation of barriers on the Gateway Bridge, in 1993, the number of suicides from this site dropped 53.0% in the period 1994–1997 (p=0.041) and a further reduction was found in subsequent years. Analyses confirmed that there was no evidence of displacement to a neighbouring suicide hotspot (Story Bridge) or other locations.

Conclusions: The safety barriers were effective in preventing suicide

A summary of Table 1 is as follows.

Suicides by jumping from a high place in Brisbane: 1990-2012

Gateway Story Other bridges Other jumping sites Total

1990-1993 22 15 6 13 56

1994-1997 11 17 2 16 46

1998-2012 5 56 12 102 175

Total 38 88 20 131 277

Answer the following questions, briefly justifying your answer.

What is the study design? [2 marks]
For the comparison of suicide numbers from the Gateway bridge between the periods 1990-93 and 1994-97.
State the null hypothesis and alternative hypothesis being tested.
[1 mark]

ii Explain how the hypotheses would be tested (the actual calculations are not needed). [2 marks]

How would you test the hypothesis that the proportions of suicides from the four different locations did not change over the three time periods (the actual calculations are not needed)? [3 marks]
Is the conclusion in the Abstract justified by the results? [2 marks]

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

Explain why logarithm transformations are our friends, while doing research in econometrics.

Statistics
Paper, Order, or Assignment Requirements

You can answer each question in about two or three sentences. Again, put some thought into your answer but this is not a major essay. Your goal is to show what is going on with the key concepts we have discussed in class so far.

The exam should be done by each individual alone. There is to be no collaboration or collusions with other students. Answer in your own words, you are free to consult notes and the text, but do not cut and paste from the web. As presented in class, the essays may be done with a referral to formulae, but all of the written material in the essay is original, “synthetic” independent work, with no copying of published or unpublished material from the web or any other source.

As you answer each question, please paste in the question number and question before your answer.

1. Explain why logarithm transformations are our friends, while doing research in econometrics.

2. What is the dummy variable trap?

3. What is the Chow test for structural change?

4. Explain the Ramsey RESET test] and explain the intuition of this test as a test of neglected nonlinearity.

5. Explain why the adjusted R-squared is not a particularly useful diagnostic for evaluating the success of a regression specification and result?

6. Explain the Box-Cox procedure for evaluating the linear vs. the logarithmic specification of a regression equation.

7. What is a slope dummy or interactive dummy variable? How does it differ from an intercept dummy?

8. What are the relative merits of the Chow test vs. dummy variables for testing structural change?

9. What are the key econometric issues related to the estimation of VAR models?

10. Explain the Breusch-Godfrey test for detecting autocorrelation or the the Cochrane-Orcutt iterative procedure for eliminating AR(1) autocorrelation

11. Explain the benefits of the Impulse Response function obtained from a VAR model? Explain how confidence intervals can be obtained from bootstrapping?

12. What are the key problems of using VAR’s for obtaining IR and Variance decomposition results?

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

develop a series of poll questions to determine 1) the candidates people will vote for and 2) the issues they care for.

Statistics Project
Paper, Order, or Assignment Requirements

This project is intended for you as the student to use the statistical tools that were introduced
this semester and put together a real example. For this assignment, you will be playing the role
of a hired statistician so explore all possible avenues and tools to answer your employer the
information they seek. As a foundation you will be performing the 5 steps to every stats
problem.
Scenario
You have been hired by a major cable news network to complete a poll for the upcoming
Presidential Primary campaign starting in the Spring. They are interested in finding out how
America is posed to vote in January in the New Hampshire Primary and the Iowa Caucus. Use
the information at the following link as an example/model to build your project.
http://www.foxnews.com/politics/interactive/2015/11/04/fox-poll-gop-nomination-race-cominginto-
focus/
You are tasked to develop a series of poll questions to determine 1) the candidates people will
vote for and 2) the issues they care for. In addition, you will ask your respondents the identify
demographic information of these individuals to include at least five characteristics to include
income, race, education, gender, and political party. When you have developed the poll
questions, you will develop a sampling method to sample 200 individuals.
When you complete the mathematics, who wins in the polls? What is the margin of error?
What conclusions can you make? How do the differences play out between all the
demographics? Use the example poll to answer necessary questions but you are not expected
to provide a full duplication.
Finally, write a 3-5 page report describing your project, how you collected the data, and the
conclusions. All charts and tables will not be included in the 3-5 pages but will be in an
appendix. The format of the report will be 1 inch margins and 1.5 line spacing.

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

You will need to analyse two variables together. Are there differences in monthly bills between users who are on a pre-paid plan and those on a post-paid plan, or are there no significant differences?

Statistics
Paper, Order, or Assignment Requirements

this assignment contains two parts, part 1 and part 2

PART 1

1. Please provide me with a summary of Smartphone Users’ Monthly bills.

You will need to use numerical analysis methods to summarise the sampled users’ monthly bills. This should include relevant summary measures and visualisations.
You will need to write about the averages, variability etc. as in the examples in the relevant lecture worksheet and tutorial.

2. Users can be on a Prepaid or Postpaid plan. Provide a breakdown of monthly bills for each plan type.

You will need to analyse two variables together. Are there differences in monthly bills between users who are on a pre-paid plan and those on a post-paid plan, or are there no significant differences?
Focus on whether there are differences or not. If yes, make sure you describe them. If not, you should explain why you think there are no differences.

3. I understand that most smartphone users tend to use their Smartphone to download entertainment content.Please provide a summary of the most frequently downloaded entertainment content.

You will need to use categorical analysis methods to summarise downloaded entertainment content.
Briefly describe your findings. Examples can be found in the relevant lecture and tutorial.

4. I wish to write about current user satisfaction with their providers.
a. Please advise me on the overall distribution of satisfaction levels. Are users generally satisfied or not?
You will need to use categorical analysis methods to summarise users’ satisfaction with their providers.
Briefly describe your findings. Examples can be found in the relevant lecture and tutorial.
b. Can you provide me with a summary of user satisfaction for male and female users?

You will need to analyse two variables together. Make sure you look at the correct section of your output to see if there is a relationship between the variables, which you will then need to discuss.
Focus on whether there are differences or not. If yes, make sure you report them. If not, describe why not.
5. Please tell me whether factors such as Number of Calls, SMSs, MMSs and the percentage use of their Smartphone for work influence the monthly bill? Are there any factors that stand out as having a greater influence than other factors?

Once more you will need to use techniques for the analysis of two variables. You will need to analyse four (4) relationships, each with Monthly Bill:
• Calls
• SMS
• MMS
• Percent for Work
In each case, the type of the relationship – if there is one – and its direction and strength are of interest.
Make sure you comment on what you find in each case.

• Save your computer analysis frequently (every 10 to 15 minutes).

Part 2: Memorandum(= Your Reply to the Editor’s Memorandum)

You are required to reply to the editor by memorandum, explaining essential information and conclusions from your data analysis. You are allowed no more than two pages to convey your written findings.

• Keep the English simple and the explanations brief.
• Avoid the use of technical statistical terminology. The editor will not necessarily understand
even simple statistical terms: thus your task is to explain your analysis using plain, understandable language.
The memorandum is to be written as a separate document. Thus, you should not have any direct references to your analysis in your memorandum.

• When writing your reply, make sure that you actually provide the information the editor requested in her memorandum to you. That is, answer the editor’s questions.

• Do not refer to or include computer output tables and charts in your response letter.

• Number your answers in your letter 1, 2 a, … etc., to match the sequence of the editor’s requests.

• Include an introduction at the start of the memorandum and a summary/conclusion at the end.

Marks will be deducted for the use of technical terms, poor grammar, awkward sentence structure, poor spelling and punctuation, irrelevant material, poor presentation/organisation and a memorandum that is over two (2) pages long.

When you have completed the memorandum, it is a useful exercise to leave it for a day, return to it and re‐read it as if you knew nothing about the analysis. Does it flow easily? Does it make sense? Can someone without prior knowledge follow your written conclusions? Often on re‐reading, you become aware that you have made some points in a clumsy manner and you find that you can re‐phrase them much more clearly.

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"

The formula for a regression equation is Y’ = 2X + 9. What would be the predicted score for a person scoring 6 on X? If someone’s predicted score was 14, what was this person’s score on X?

Statistics
Paper, Order, or Assignment Requirements

The formula for a regression equation is Y’ = 2X + 9.
What would be the predicted score for a person scoring 6 on X?
If someone’s predicted score was 14, what was this person’s score on X?

For the X,Y data below, compute:
r and determine if it is significantly different from zero.
the slope of the regression line and test if it differs significantly from zero.
the 95% confidence interval for the slope.
X y
4 6
3 7
5 12
11 17
10 9
14 17

At a school pep rally, a group of sophomore students organized a free raffle for prizes. They claim that they put the names of all of the students in the school in the basket and that they randomly drew 36 names out of this basket. Of the prize winners, 6 were freshmen, 14 were sophomores, 9 were juniors, and 7 were seniors. The results do not seem that random to you. You think it is a little fishy that sophomores organized the raffle and also won the most prizes. Your school is composed of 30% freshmen, 25% sophomores, 25% juniors, and 20% seniors.
What are the expected frequencies of winners from each class?
Conduct a significance test to determine whether the winners of the prizes were distributed throughout the classes as would be expected based on the percentage of students in each group. Report your Chi Square and p values.
What do you conclude?

A geologist collects hand-specimen sized pieces of limestone from a particular area. A qualitative assessment of both texture and color is made with the following results. Is there evidence of association between color and texture for these limestones? Explain your answer.
colour
texture Light Medium Dark
Fine 4 20 8
medium 5 23 12
course 21 23 4

True or False ? The standard deviation of the chi-square distribution is twice the mean.

Do men and women select different breakfasts? The breakfasts ordered by randomly selected men and women at a popular breakfast place is shown. Conduct a test for homogeneity at a 5% level of significance
H0: _______
Ha: _______
In words, clearly state what your random variable X ¯ 1− X ¯2, P′1−P′2 or X ¯d represents.
State the distribution to use for the test.
What is the test statistic?
What is the p-value? In one to two complete sentences, explain what the p-value means for this problem Indicate the correct decision (“reject”or“do not reject” the null hypothesis), the reason for it, and write an appropriate conclusion, using complete sentences.
Alpha: _______
Decision: _______
Reason for decision: _______
Conclusion: _______ .
In complete sentences, explain how you determined which distribution to use.

Suppose an airline claims that its flights are consistently on time with an average delay of at most 15 minutes. It claims that the average delay is so consistent that the variance is no more than 150 minutes. Doubting the consistency part of the claim, a disgruntled traveler calculates the delays for his next 25 flights. The average delay for those 25 flights is 22 minutes with a standard deviation of 15 minutes. df= ________

Suppose an airline claims that its flights are consistently on time with an average delay of at most 15 minutes. It claims that the average delay is so consistent that the variance is no more than 150 minutes. Doubting the consistency part of the claim, a disgruntled traveler calculates the delays for his next 25 flights. The average delay for those 25 flights is 22 minutes with a standard deviation of 15 minutes. Letα= 0.05 Decision: ________ Conclusion (write out in a complete sentence.): ________

Can a coefficient of determination be negative? Why or why not?

Size (ounces) Cost Cost per ounce
6 3.99
32 4.99
64 5.99
200 10.99

Using “size” as the independent variable and “cost” as the dependent variable, draw a scatter plot.

Does it appear from inspection that there is a relationship between the variables? Why or why not?
Calculate the least-squares line. Put the equation in the form of: ŷ=a+bx
Find the correlation coefficient. Is it significant?
If the laundry detergent were sold in a 40-ounce size, find the estimated cost.
If the laundry detergent were sold in a 90-ounce size, find the estimated cost.
Does it appear that a line is the best way to fit the data? Why or why not?
Are there any outliers in the given data?
Istheleast-squareslinevalidforpredictingwhata300-ouncesizeofthelaundrydetergentwouldyoucost?Why or why not?
What is the slope of the least-squares (best-fit) line? Interpret the slope

 

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"