Question

Regression analysis is an important statistical method for the analysis of business data. It enab...

Regression analysis is an important statistical method for the analysis of business data. It enables the identification and characterization of relationships among factors and enables the identification of areas of significance.

The performance and interpretation of linear regression analysis are subject to a variety of pitfalls. Comment on what these pitfalls may be and how you would avoid them. Use an example if it helps to clarify the point.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Pitfalls and solutions for regression analysis

We often have information on two numeric characteristics for each member of a group and believe that these are related to each other – i.e. values of one characteristic vary depending on the values of the other. For instance, in a recent study, researchers had data on body mass index (BMI) and mid-upper arm circumference (MUAC) on 1373 hospitalized patients, and they decided to determine whether there was a relationship between BMI and MUAC.[1] In such a situation, as we discussed in a recent piece on “Correlation” in this series,[2] the researchers would plot the data on a scatter diagram. If the dots fall roughly along a straight line, sloping either upwards or downwards, they would conclude that a relationship exists. As a next step, they may be tempted to ask whether, knowing the value of one variable (MUAC), it is possible to predict the value of the other variable (BMI) in the study group. This can be done using “simple linear regression” analysis, also sometimes referred to as “linear regression.” The variable whose value is known (MUAC here) is referred to as the independent (or predictor or explanatory) variable, and the variable whose value is being predicted (BMI here) is referred to as the dependent (or outcome or response) variable. The independent and dependent variables are, by convention, referred to as “x” and “y” and are plotted on horizontal and vertical axes, respectively.

At times, one is interested in predicting the value of a numerical response variable based on the values of more than one numeric predictors. For instance, one study found that whole-body fat content in men could be predicted using information on thigh circumference, triceps and thigh skinfold thickness, biceps muscle thickness, weight, and height.[3] This is done using “multiple linear regression.” We will not discuss this more complex form of regression.

Although the concepts of “correlation” and “linear regression” are somewhat related and share some assumptions, these also have some important differences, as we discuss later in this piece.

ASSUMPTIONS

Regression analysis makes several assumptions, which are quite akin to those for correlation analysis, as we discussed in a recent issue of the journal.[1] To recapitulate, first, the relationship between x and y should be linear. Second, all the observations in a sample must be independent of each other; thus, this method should not be used if the data include more than one observation on any individual. Furthermore, the data must not include one or a few extreme values since these may create a false sense of relationship in the data even when none exists. If these assumptions are not met, the results of linear regression analysis may be misleading.

CORRELATION VERSUS REGRESSION

Correlation and regression analyses are similar in that these assess the linear relationship between two quantitative variables. However, these look at different aspects of this relationship. Simple linear regression (i.e., its coefficient or “b”) predicts the nature of the association – it provides a means of predicting the value of dependent variable using the value of predictor variable. It indicates how much and in which direction the dependent variable changes on average for a unit increase in the latter. By contrast, correlation (i.e., correlation coefficient or “r”) provides a measure of the strength of linear association – a measure of how closely the individual data points lie on the regression line. The values of “b” and “r” always carry the same sign – either both are positive or both are negative. However, their magnitudes can vary widely. For the same value of “b,” the magnitude of “r” can vary from 1.0 to close to 0.

ADDITIONAL CONSIDERATIONS

Some points must be kept in mind when interpreting the results of regression analysis. The absolute value of regression coefficient (“b”) depends on the units used to measure the two variables. For instance, in a linear regression equation of BMI (independent) versus MUAC (dependent), the value of “b” will be 2.54-fold higher if the MUAC is expressed in inches instead of in centimeters (1 inch = 2.54 cm); alternatively, if the MUAC is expressed in millimeters, the regression coefficient will become one-tenth of the original value (1 mm = 1/10 cm). A change in the unit of “y” will also lead to a change in the value of the regression coefficient. This must be kept in mind when interpreting the absolute value of a regression coefficient.

Similarly, the value of “intercept” also depends on the unit used to measure the dependent variable. Another important point to remember about the “intercept” is that its value may not be biologically or clinically interpretable. For instance, in the MUAC-BMI example above, the intercept was −0.042, a negative value for BMI which is clearly implausible. This happens when, in real-life, the value of independent variable cannot be 0 as was the case for the MUAC-BMI example above (think of MUAC = 0; it simply cannot occur in real-life).

Furthermore, a regression equation should be used for prediction only for those values of the independent variable that lie within in the range of the latter's values in the data originally used to develop the regression equation.

Add a comment
Know the answer?
Add Answer to:
Regression analysis is an important statistical method for the analysis of business data. It enab...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • regression analysis is an important statistical method for the analysis of business data. It enables the...

    regression analysis is an important statistical method for the analysis of business data. It enables the identification and characterization of relationships among factors and enables the identification of areas of significance. The performance and interpretation of multiple linear regression analysis is subject to a variety of pitfalls similar to simple linear regression. Comment on additional pitfalls when analyzing multiple factors and how you would avoid them. Use an example if it helps to clarify the point.

  • In statistical modeling, regression analysis helps you to: a. None of them b. estimate the relationships...

    In statistical modeling, regression analysis helps you to: a. None of them b. estimate the relationships between two dependent variables and one independent variable. c. estimate the relationships between a dependent variable and one or more independent variables. d. calculate the exact values for the dependent and independent variables.

  • Regression analysis is a statistical technique that fits a line to observed data points so that...

    Regression analysis is a statistical technique that fits a line to observed data points so that the resulting equation can be used to forecast other data points. It is useful in excess capacity and economies of scale situations. Economies of scale occur when the ratio of asset to sales will change as the size of the firm increases. Regression analysis can lead to improved financial forecasts and better information which can be used to improve management's actions. Quantitative Problem: Jasper...

  • 2. The following data were collected last semester on ten students. Complete a multiple regression analysis in which you use AGE (A), MATH PROFICIENCY (MP) (on a 1 –10 scale), and GENDER (G) (0 = male...

    2. The following data were collected last semester on ten students. Complete a multiple regression analysis in which you use AGE (A), MATH PROFICIENCY (MP) (on a 1 –10 scale), and GENDER (G) (0 = male, 1 = female) as predictors of FINAL EXAM (FE) performance. Do this analysis in SPSS and then answer the following questions. Subject # A MP G FE 1 35 8 1 90 2 31 6 0 88 3 26 5 1 84 4 33...

  • Regression and Correlation Methods: Correlation, ANOVA, and Least Squares This is another way of assessing the...

    Regression and Correlation Methods: Correlation, ANOVA, and Least Squares This is another way of assessing the possible association between a normally distributed variable y and a categorical variable x. These techniques are special cases of linear regression methods. The purpose of the assignment is to demonstrate methods of regression and correlation analysis in which two different variables in the same sample are related. The following are three important statistics, or methodologies, for using correlation and regression: Pearson's correlation coefficient ANOVA...

  • Data Analysis is designed to build a strong foundation for development of statistical literacy and statistical...

    Data Analysis is designed to build a strong foundation for development of statistical literacy and statistical thinking. Both are essential for business success and further studies in business. Recent research showed that “On average for 15-year-old Australian students, females achieved at a significantly lower level than male students (in mathematics)” (p.2, Buckley 2016) 1 . However, there is also research evidence that “Despite the stereotype that boys do better in math and science, girls have made higher grades than boys...

  • Case Study Demand Forecasting For an organization to provide customer delight it is important that organization...

    Case Study Demand Forecasting For an organization to provide customer delight it is important that organization can understand what customer wants and how much do they want. If an organization can gauge future demand that manufacturing plan becomes simpler and cost-effective. The process of analyzing and understanding current and past information to understand future patterns through a scientific and systemic approach is called forecasting. And the process of estimating the future demand of the product in terms of a unit...

  • Why are networks and industry relationships important to TWC? What other strategies could an eco-tourism business...

    Why are networks and industry relationships important to TWC? What other strategies could an eco-tourism business of this size use to source ideas and incorporate into its new product development strategy? Tasmanian Walking Company: Balancing luxury and adventure in a sustainable experience Gemma Lewis, PhD University of Tasmania, Australia the organic skincare range supplied by LITYA (Li'tya, 2016). Before introducing this new activity, TWC had to adapt certain treatments to ensure they maintained ocus on sustainable resaurce usage. Their outecor...

  • (I did this homework in completion but professor was not happy with answers whatsoever, need additional...

    (I did this homework in completion but professor was not happy with answers whatsoever, need additional answers and especially improvement to 1.b help!! photos not attaching? mean by severai steps. inis is a View Feedback homework and will need you to work, in one two View Feedback or various steps. Unfortunately, I cannot read your screen shot of what you did on excel. As I have said in numerous messages announcements etc, I cannot аcсept pictures. You need to write...

  • Selecting appropriate assessment techniques l: high quality assessments For an assessment to be h...

    5-7 sentences initial thoughts Selecting appropriate assessment techniques l: high quality assessments For an assessment to be high quality it needs to have good validity and reliability as well as absence from bias. Validity Validity is the evaluation of the “adequacy and appropriateness of the interpretations and uses of assessment results for a given group of individuals (Linn & Miller, 2005, p.68). For example is it appropriate to conclude that the results of a mathematics test on fractions given to...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT