Question

USE R STUDIO The stackloss data frame available in R contains 21 observations on four variables...

USE R STUDIO

The stackloss data frame available in R contains 21 observations on four variables taken at a factory where ammonia is converted to nitric acid. The first three variables are Air.Flow, Water.Temp, and Acid.Conc. The fourth variable is stack.loss, which measures the amount of ammonia that escapes before being absorbed. Read the help file for more information about this data frame.

- Give a numerical summarization of each column of the dataset, then use boxplots to help illustrating the summarization.

- Use a pairwise scatterplot to explore possible relationships between acid concentration, water temperature, and air flow. Is there a linear relationship between any pair of variables? Write it as a comment in your R script file.

- Draw scatter plot the fourth variable versus each of the first three variables. Discover if there is any linear relationship between the acid concentration, the water temperature, the air flow and the amount of ammonia that escapes before being absorbed. Write it as a comment in your R script file.

- From the above part, you may discover multiple scatter plots indicating linear trends. Choose one of them (Air.Flow, Water.Temp, or Acid.Conc) as explanatory variable and build a simple linear regression model based on this variable and the response (stack.loss). You need to include at least the following in your code:

– Scatter plot with the linear regression line on the data.

– Write as comments the interpretation of the linear regression coefficients.

– Check for normality of residuals using QQ-plot. Write some comments on the normality check.

– Check constant variance of error using the residual plot. Write some comments on the variance of residuals.

– Check the goodness of fit by extracting and reporting the R2 value. Write some comments on the goodness of fit.

1 0
Add a comment Improve this question Transcribed image text
Answer #1

> data <- stackloss > # 1) Numerical summarization of each column of the dataset > summary (data) Air.Flow Min. Water.Temp Mi> # 4) > # a) > plot (data$Water.Temp , data$stack.loss) > abline (1m (data$stack.loss~data$water.Temp)) > # there is a linea> # e) > summary(regression) Call: data$Air.Flow) 1m (formula = data$stack.loss Residuals: Min Median 3Q 1.1166 Max 1Q -1.127Note : just run the given rcode and plot the graphs.

Add a comment
Know the answer?
Add Answer to:
USE R STUDIO The stackloss data frame available in R contains 21 observations on four variables...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Consider the following data for two variables, x and y.

     Consider the following data for two variables, x and y. a. Choose the correct scatter diagram with x and y. The correct scatter diagram is - _______ . Does there appear to be a linear relationship between x and y? Explain. The scatter diagram- Select your answer - some evidence of a possible linear relationship. b. Develop the estimated regression equation relating x and y. Save "predicted" and "residuals" (to 4 decimals).  c. Choose the correct scatter diagram or the residuals versus y tor the estimated...

  • Exercise 1. For this exercise use the bdims data set from the openintro package. Type ?bdims to r...

    Exercise 1. For this exercise use the bdims data set from the openintro package. Type ?bdims to read about this data set in the help menu. Of interest are the variables hgt (height in centimeters), wgt (weight in kilograms), and sex (dummy variable with 1-male, 0-female). Since ggplotO requires that a categorical variable be coded as a factor type in R, run the following code: library (openintro) bdíms$sex2 <-factor (bdins$sex, levels-c (0,1), labels=c('F', 'M')) (a) Use ggplot2 to make a...

  • Please provide with R codes! thank you!! Data: Question: Data: 179 161 162 605557 155 60...

    Please provide with R codes! thank you!! Data: Question: Data: 179 161 162 605557 155 60 158 56 172 57 191 60 179 57 163 58 Height (cm) Head Circumference (cm) 2. Draw at most 3 plots to visually describe your data. Is your response variable approximately Normal? 3. Numerically describe the centre, spread and any unusual points of your variables/data. 4. Fit and describe a simple linear regression model between head circumference and height. 5. Are the regression parameters...

  • Please help me with these questions with R codes.. thank you!! Here’s the data I have...

    Please help me with these questions with R codes.. thank you!! Here’s the data I have obtained for the questions: Data: 9 students in total Height(cm) Head Circumference(cm) 179 60 161 55 162 57 155 60 158 56 172 57 191 60 179 57 163 58 2. Draw at most 3 plots to visually describe your data. Is your response variable approximately Normal? 3. Numerically describe the centre, spread and any unusual points of your variables/data. 4. Fit and describe...

  • here is the data Y X 34.38 22.06 30.38 19.88 26.13 18.83 31.85 22.09 26.77 17.19...

    here is the data Y X 34.38 22.06 30.38 19.88 26.13 18.83 31.85 22.09 26.77 17.19 29.00 20.72 28.92 18.10 26.30 18.01 29.49 18.69 31.36 18.05 27.07 17.75 31.17 19.96 27.74 17.87 30.01 20.20 29.61 20.65 31.78 20.32 32.93 21.37 30.29 17.31 28.57 23.50 29.80 22.02 this is q1 2. Using a computer program, a data set with 20 observations on and was generated (GENERATEDATA.csv). Again, use the same instructions as above (in #1) to upload and import this dataset...

  • Please make sure you write the R cude, R-PROGRAM Data was collected in order to investigate...

    Please make sure you write the R cude, R-PROGRAM Data was collected in order to investigate the effect of temperature on the yield of a chemical process. Temperatures were set at eight specific levels, and the yield was determined for five replications at each evel 4. a. Identify the response variable and the predictor variable. Explain your reasoning b. Describe the two sources of variation (variation due to regression and variation about the regression) in terms of this particular problem...

  • The Book of R (Question 20.2) Please answer using R code. Continue using the survey data...

    The Book of R (Question 20.2) Please answer using R code. Continue using the survey data frame from the package MASS for the next few exercises. The survey data set has a variable named Exer , a factor with k = 3 levels describing the amount of physical exercise time each student gets: none, some, or frequent. Obtain a count of the number of students in each category and produce side-by-side boxplots of student height split by exercise. Assuming independence...

  • **R-STUDIO KNOWLEDGE REQUIRED*** PLEASE ANSWER THE FOLLOWING WITH ****R-STUDIO**** CODING- thank ...

    **R-STUDIO KNOWLEDGE REQUIRED*** PLEASE ANSWER THE FOLLOWING WITH ****R-STUDIO**** CODING- thank you so much!! I am specifically look for the solution to part ***(h)**** and *****(i)***** below using R-Studio code: The data set in question is: YEAR Height Stories 1990 770 54 1980 677 47 1990 428 28 1989 410 38 1966 371 29 1976 504 38 1974 1136 80 1991 695 52 1982 551 45 1986 550 40 1931 568 49 1979 504 33 1988 560 50 1973 512...

  • A group of physics students collected data from a test of the projectile motion problem that...

    A group of physics students collected data from a test of the projectile motion problem that was analyzed in a previous lab exercise (L5). In their test, the students varied the angle and initial velocity Vo at which the projectile was launched, and then measured the resulting time of flight (tright). Note that tright was the dependent variable, while and Vo were independent variables. The results are listed below. (degrees) Time of Flight (s) Initial Velocity V. (m/s) 15 20...

  • Solve the following problem in R and print out the commands and outputs. 3. The data...

    Solve the following problem in R and print out the commands and outputs. 3. The data set fancy (you need to library the fpp package to get the dataset) concern the monthly sales figures of a shop which opened in January 1987 and sells gifts, souvenirs, and novelties. The sales volume varies with the seasonal population of tourists. (a) Produce a time plot of the data and describe the patterns in the graph. Identify any unusual or unexpected fluctuations in...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT