PYTHON import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import...

Question

Question

PYTHON

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split

Our goal is to create a linear regression model to estimate values of ln_price using ln_carat as the only feature. We will now prepare the feature and label arrays.

Our goal is to create a linear regression model to estimate values of In_price using In_carat as the only feature. We will no We will now calculate the r-squared score for the model on both the training set and the test set. Calculate and print the tr

"carat"   "cut" "color"   "clarity"   "depth"   "table"   "price"   "x"   "y"   "z"
"1" 0.23   "Ideal" "E" "SI2" 61.5 55 326   3.95   3.98   2.43
"2" 0.21   "Premium" "E" "SI1" 59.8 61 326   3.89   3.84   2.31
"3"   0.23   "Good" "E" "VS1" 56.9 65 327   4.05   4.07   2.31
"4"   0.29   "Premium" "I" "VS2" 62.4 58 334   4.2 4.23   2.63
"5"   0.31   "Good" "J" "SI2" 63.3 58 335   4.34   4.35   2.75
"6"   0.24   "Very Good"   "J" "VVS2"   62.8 57 336   3.94   3.96   2.48
"7"   0.24   "Very Good"   "I" "VVS1"   62.3 57 336   3.95   3.98   2.47
"8"   0.26   "Very Good"   "H" "SI1" 61.9 55 337   4.07   4.11   2.53
"9"   0.22 "Fair" "E" "VS2" 65.1 61 337   3.87 3.78   2.49
"10"   0.23   "Very Good"   "H"   "VS1" 59.4 61 338   4 4.05 2.39

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

I have written comments for the code. Please go through them

The data points given are less so for splitting i have used 0.2 percentage please change that 0.1 while you are using the script i have included the line as comment just uncomment it and use it when you have enough data. I have mentioned in the code as well please go through that.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split

df = pd.read_csv('data.csv') # Read as csv file 

X2 = np.log(df['carat'].to_numpy().reshape(df.shape[0],1)) # converting carat to ln_carat and numpy  2d array
y = np.log(df['price'].to_numpy()) # converting price to ln_price and numpy 1d array

# I have used 0.2 test size here because there are only 10 data points are given and to calculate r-squared error 
# we need atleast two data points in test data set. So please delete this line and use the commented line when you have enough data
X_train_2, X_test_2, y_train_2, y_test_2 = train_test_split(X2, y, test_size=0.2, random_state = 1)
# X_train_2, X_test_2, y_train_2, y_test_2 = train_test_split(X2, y, test_size=0.1, random_state = 1) 

print("Training Features Shape: {}".format(X_train_2.shape))
print("Test Features Shape:     {}".format(X_test_2.shape))

dia_mod = LinearRegression() # Initalizing linear regression Model
dia_mod.fit(X_train_2, y_train_2) # fitting data with linear regression model.

print("Intercept:    {}".format(dia_mod.intercept_))
print("Coefficients: {}".format(dia_mod.coef_))


print('Training r-Squared: {}'.format(dia_mod.score(X_train_2,y_train_2)))
print('Testing r-Squared:  {}'.format(dia_mod.score(X_test_2,y_test_2)))

test_pred_2 = dia_mod.predict(X_test_2)
print("Observed Prices:  {}".format(np.exp(y_test_2))) # converting natural log values using exponential exp(log_base_e(2)) = 2
print("Estimated Prices: {}".format(np.exp(test_pred_2).round())) # converting natural log values using exponential exp(log_base_e(2)) = 2

diamonds_new = [0.5, 1.0, 1.5, 2.0, 2.5, 3.0] # given new list
diamonds_new = np.array(diamonds_new).reshape(len(diamonds_new),1) # conver to 2d column array 

new_pred_2 = dia_mod.predict(diamonds_new) # predicting using model
new_pred_3 = np.exp(new_pred_2) # converting natural log to normal values using np.exp
print("Esitmated Prices: {}".format(new_pred_3.round())) # round to nearest whole number/ prices.

Output Snippets:

-3]: import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import LinearRegression

$: print(Training Features Shape: {}.format(X_train_2. shape)) print(Test Features Shape: {}.format(X_test_2. shape)) Trai$

1: dia_mod LinearRegression () # Initalizing Linear regression Model : dia_mod. fit(X_train_2, y_train_2) # fitting data with

- : diamonds_new [0.5, 1.0, 1.5, 2.0, 2.5, 3.0] # given new list diamonds_new = np.array(diamonds_new).reshape(len (diamonds_ Please drop a comment for any further doubts.

Add a comment

Answer 2

PYTHON import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import...

Homework Answers

Add Answer to:
PYTHON import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import...

Post as a guest

Earn Coins

This is my code: import numpy as np import pandas as pd import sys from keras.models...

Python 1 import matplotlib.pyplot as plt 2 import numpy as np 3 4 abscissa = np.arange(20) 5 plt....

Before you start For this homework, we will need to import some libraries. You need to...

package homework; import java.util.Arrays; import stdlib.*; /** CSC300Homework4 version 1.0 * * * * Find...

In Problem Set 7 you designed and implemented a Message class. This time, let's design and...

What are the major areas of change from the old design to the new design? What...

Assignment Predator / Prey Objectives Reading from and writing to text files Implementing mathematical formulas in...

How can we assess whether a project is a success or a failure? This case presents...

I need Summary of this Paper i dont need long summary i need What methodology they used , what is the purpose of this p...

PYTHON import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import...

Homework Answers

Add Answer to: PYTHON import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import...

Post as a guest

Earn Coins

Add Answer to:
PYTHON import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.linear_model import...