Take the log of all the data points to obtain the following table:
Logs | |||||||
4.074634 | 3.954194 | 3.532117 | 3.785187 | 2.641474 | 3.372544 | 2.499687 | 2.955207 |
3.103119 | 3.877544 | 2.699838 | 2.429752 | 3.878694 | 4.512698 | 2.004321 | 2.064458 |
2.647383 | 3.461348 | 3.431203 | 2.053078 | 2.996512 | 4.040761 | 2.633468 | 3.031408 |
2.969882 | 2.749736 | 2.357935 | 3.127753 | 2.568202 | 1.579784 | 4.07613 | 4.217826 |
4.6728 | 4.030559 | 3.72583 | 2.539076 | 1.662758 | 3.686904 | 2.146128 | 2.155336 |
2.079181 | 3.688331 | 2.68842 | 2.901458 | 2.514548 | 2.677607 | 2.888741 | 1.748188 |
4.581574 | 3.354876 | 2.613842 | 1.845098 | 3.851136 | 3.212188 | 3.820858 | 3.302114 |
2.55145 | 3.767304 | 4.998011 | 2.861534 | 3.0306 | 2.238046 | 3.008174 | 3.952841 |
2.994757 | 2.810233 | 2.176091 | 3.60401 | 3.379668 | 4.243881 | 1.724276 | 2.33646 |
2.133539 | 3.55279 | 3.443263 | 3.680154 | 3.054613 | 2.958086 | 1.491362 | 3.6861 |
Now, using bins of size 0.5 from 0 to 5, we get the following histogram for the data in Excel (Data -> Data Analysis -> Histogram, choose the above table data range as input and these bins as the bin range):
The logarithm of the data indeed looks like following a normal distribution as is evident from the shape of the curve above.
This implies that the original data given follows a lognormal distribution.
Let's call the random variable generating the log data above Y and the original data X.
Calculating the mean of X from data, we get Mean μ = 3.067, and Std Dev σ = 0.796
Hence, X ~ N(μ, σ2) ~ N(3.067, 0.7962)
Y is lognormal, Y has Mean μY = e() = e(3.067 + 1/2*0.796^2) = 29.48
And, Variance, σY2 = (e() - 1) * e
= (1.884 - 1) * 869 = 768.25
Std Dev, σY = 27.72
Exercise 2 Consider the following observations. 1261,11972,3664,1474, 3590, 11171,25,1697,3571,23...
Given the enclosed set of fifty observations of Weight of Coating of Galvanized Iron Sheets (oz/ft?) It is desired to perform the following determinations 1) Estimate the population parameters mean and standard deviation using the method of moments Form two histograms of the fifty observations. One showing the frequency of observations, and the second showing the probability density in each class interval 2) 3) Draw this last histogram together with the PDF of the normal distribution for the mean and...
Consider the following frequency table of observations on the random variable X. Values 0 1 2 3 4 5 Observed Frequency 8 25 22 21 16 8 (a) Based on these 100 observations, is a Poisson distribution with a mean of 2.4 an appropriate model? Perform a goodness-of-fit procedure with α=0.05. Which of the following is the correct conclusion? (b) Which of the following are the correct bounds on the P-value for this test.
Consider the following four observations of a process of interest: 89 24 9 50 You are trying to decide whether an exponential or a uniform distribution would be a better fit for the data. a) Develop Q-Q plots for the exponential and uniform distributions, using the data to estimate any parameters you need (NOTE: Your graphs do not need to be perfectly to scale, but they should be readable and you need to compute the graph value b) Which distribution...
Please explain if any of my answers are wrong. Thank you. Suppose a statistician wishes to test whether a large number of observations Xi follows an exponential distribution with parameter A = 1. He wishes to test this hypothesis exactly, and intends that if the observations follow an exponential distribution with a different parameter the test should reject the null hypothesis given sufficiently many observations. In addition, he wants to have a numeric statistic that he could report and does...
Question 1 Consider the following Multiple Regression Model yı BoB1B2 + El, y2 BIB2E2 y3 B2Es, and y4 Bo+BI4 Suppose that & 's are independent and identically distributed N(0, o2 ) a) Write down the model in the matrix form b) Show that 2 2 1 X'X2 3 2 1.67 -1.33 0.33 (X'X) 1.67 Note that -1.33 -0.67 1 2 3 0.33 -0.67 0.67 c) Find unbiased estimators for Bo, Bi, and B2 given that y 3, y2 1, y3-...
Need help with stats true or false questions Decide (with short explanations) whether the following statements are true or false a) We consider the model y-Ao +A(z) +E. Let (-0.01, 1.5) be a 95% confidence interval for A In this case, a t-test with significance level 1% rejects the null hypothesis Ho : A-0 against a two sided alternative. b) Complicated models with a lot of parameters are better for prediction then simple models with just a few parameters c)...
2. Goodness-of-fit tests Normal population Aa Aa Manufacturing processes, such as coin minting, are subject to small variations due to variations in materials, temperature, and humidity. The variations in materials, temperature, and humidity from their norms are just as likely to be positive as negative but are more likely to be small than large. Consider the 1 coin issued by Belgium. Realizing that there are small variations in the minting process and random error in the weighing process, it might...
SOME DRAWBACKS OF BLACK-SCHOLES To provide one motivation for the development of ARCH models (next handout), we briefly discuss here some difficulties associated with the Black Scholes formula, which is widely used to calculate the price of an option. For example, consider a European call option for a stock. This is the right to buy a specific number of shares of a specific stock on a specific date in the future, at a specific price (the exercise price, also called...
Consider a multiple regression model of the dependent variable y on independent variables x1, X2, X3, and x4: Using data with n 60 observations for each of the variables, a student obtains the following estimated regression equation for the model given: y0.35 0.58x1 + 0.45x2-0.25x3 - 0.10x4 He would like to conduct significance tests for a multiple regression relationship. He uses the F test to determine whether a significant relationship exists between the dependent variable and He uses the t...
4. Testing for significance Aa Aa Consider a multiple regression model of the dependent variable y on independent variables x1, x2, X3, and x4: Using data with n = 60 observations for each of the variables, a student obtains the following estimated regression equation for the model given: 0.04 + 0.28X1 + 0.84X2-0.06x3 + 0.14x4 y She would like to conduct significance tests for a multiple regression relationship. She uses the F test to determine whether a significant relationship exists...