The data set 401KSUBS.RAW contains information on net financial wealth (nettfa), age of the survey respondent (age), annual family income (inc), family size (fsize), and participation in certain pension plans for people in the United States. The wealth and income variables are both recorded in thousands of dollars. For this question, use only the data for single-person households (so fsize = 1).
(i) How many single-person households are there in the data set?
(ii) Use OLS to estimate the model
and report the results using the usual format. Be sure to use only the single-person households in the sample. Interpret the slope coefficients. Are there any surprises in the slope estimates?
(iii) Does the intercept from the regression in part (ii) have an interesting meaning? Explain.
(iv) Find the p-value for the test H0: ft2 = 1 against H0: ft2<1. Do you reject H0 at the 1% significance level?
(v) If you do a simple regression of nettfa on inc, is the estimated coefficient on inc much different from the estimate in part (ii)? Why or why not?
(i)
In the referred data, the variable f size is used for family size. Single person family is denoted by 1. Import the data in MS-Excel and use filter to obtain the number of single person family. Therefore, there aresingle people in the sample of 9,275.
(ii)
The data is shown below:
The estimated model is written below:
The required estimated model can be obtained in Minitab by using below steps:
1. Go to Stat > Regression > Regression as shown in the below screenshot:
2. The below dialog box opens. Enter the required variables corresponding to “Response” and “Predictors” headings. The updated dialog box is shown below:
3. Click OK to get the below output:
The estimated equation is:
And:
The coefficient on inc represents that extra single dollar in income is imitated in about 80 more cents in estimated nettfa. The coefficient on age means that, if an individual becomes older by one year, his/her nettfa is estimated to increase by about $843. Both interpretations are not surprising.
(iii)
The intercept obtained above does not seems to be interesting because it provides the estimated nettfa for inc = 0 and age = 0. It can conclude that there is none of them even close to these values in the appropriate population.
(iv)
From the output obtained in part (ii), it can conclude that the p-value for ageis 0.000 that is less than the considered level of significance 0.01. Therefore, the null hypothesis may be rejected at 1% level of significance.
(v)
The estimated model is written below:
The required estimated model can be obtained by using below steps:
1. Go to Stat > Regression > Regression as shown in the below screenshot:
2. The below dialog box opens. Enter the required variables corresponding to “Response” and “Predictors” headings. The updated dialog box is shown below:
3. Click OK to get the below output:
The coefficient on inc in the simple regression output obtained above is.821 that is not very dissimilar from the .799 obtained in part (ii).
The data set 401KSUBS.RAW contains information on net financial wealth (nettfa), age of the survey respondent...
Use the Age data set and the GPA data set from the Class Survey spreadsheet to complete this project. (1 points) By submitting this completed project for grading, you are agreeing that you worked independently on this project and the work is yours. If you are found to have worked with anyone or copied another student's wo you will be given a zero on this project Question: Is there a relationship between a student's age(x), in years, and their Grade...
10. You have a data set that contains information about individuals gender, the number of children they have, their family income, and whether they are in the labor force You estimate the following linear probability model: Plaborforcel po+B1 children +B2 female +B3 (children x semale)+u a.) In terms of the model's parameters, what is the marginal effect of having an additional child on a woman's probability of being in the labor force? What is the marginal effect of having an...
Use the data in COUNTYMURDERS to answer this questions. Use only the data for 1996. (i) How many counties had zero murders in 1996? How many counties had at least one execution? What is the largest number of executions? (ii) Estimate the equation murders = b0 + b1execs + u by OLS and report the results in the usual way, including sample size and R-squared. (iii) Interpret the slope coefficient reported in part. Does the estimated equation suggest a deter-...
Question 1 Question Type 1 The following data sets each contain 3 random observations of two variables. For each data set, answer the following questions: Question (a) The data below is a random sample of 3 observations drawn from the United States population. Use the data to answer the following questions i. Find 95% confidence intervals of the population mean of experience and wage ii. Estimate pe,w, the correlation between the variables experience and wage. iii. Find Bı and Po,...
The data set consists of information on 4900 full-time full-year workers. The highest educational achievement for each worker was either a high school diploma or a bachelor's degree. The worker's ages ranged from 25 to 45 years. The data set also contained information on the region of the country where the person lived, marital status, and number of children. For the purposes of these exercises, let AHE = average hourly earnings (in 2005 dollars) College = binary variable (1 if...
Exercise 2. [Data analysis, requires R] For this questions use the bac data set from the openintro library. To access this data set first install the package using install.packages ("openintro") (this only needs to be done once). Then load the pack- age into R with the command library(openintro). You can read about this data set in the help menu by entering the command ?openintro or help(openintro). Many people believe that gender, weight, drinking habits, and many other factors are much...
Table 1: 2012 Current Population Survey Data Dependent Variable: Av Hourly Eaming 8.31 0.2 3.85 0.2 8.34 8.32 0.22 3.81 0.22 0.51 0.04 College (X) Female (X2) Age (X3) Northeast (x) Midwest (L.) South (Xs) Intercept 3.80 0.52 04 0.18 36 1.23 31 0.43 30 2.05 17.02 1.87 Forthis question, refer to the table of estimated regressions in Table 1, computed using data from the 2012 CPS. The dataset contains information on 7,440 full-time, full-year workers. The highest educational achievement...
I've lost count how many times I posted this question, at least 4 or 5. Can someone please help me out 1. In her paper on measuring the returns to high school sports, Betsey Stevenson investigates the causal implications of expansion in female sports participation caused by Title IX. Compliance with Title IX can be characterized as requiring a school to raise its female athletic participation rate to near equality with its male athletic participation rate. The paper is here:...
In an earlier tutorial, you were introduced to a data set that contains voting information for 173 US election districts. Suppose we have access to 40 observations from that data set. Throughout this assignment, data analysis is conducted on these 40 observations only The variables are defined as folloWS: xotedpercentage of the vote received by Candidate A expendA-campaign expenditure by Candidate A (in million dollars) expendB-campaign expenditure by Candidate B (in million dollars) grtvstra -a measure of party strength for...
-+ Ft to page Page view 2. A doctor wanted to determine whether there is a Age versus HDL Cholesterol relation between a male's age and his HDL (so-called good) cholesterol. He randomly selected 17 of his patients and determined their HDL cholesterol. He obtained the following data: (Use Minitab only for graphs. Show all the steps) HDL HDL Age, x Cholesterol,y Age, x Cholesterol, y 38 42 46 32 57 54 34 56 35 40 42 38 47 38...