Question

In an effort to inform political leaders and economists discussing the deregulation of electric and gas...

In an effort to inform political leaders and economists discussing the deregulation of electric and gas utilities, data on eight numerical variables from utility companies have been grouped using hierarchical clustering based on Euclidean distance as the similarity measure and complete linkage as the clustering method.

(a)Based on the following dendrogram, what is the most appropriate number of clusters to organize these utility companies?



image.png




  clusters are appropriate based on complete linkage.


(b)Using the following data on the Observations 10, 13, 4, and 20, confirm that the complete linkage distance between the cluster containing {10, 13} and the cluster containing {4, 20} is 2.577 units as displayed in the dendrogram.



image.png




If required, round your answers to three decimal places. Do not round intermediate calculations.



  • Distance from Observation 10 and Observation 4:

  • Distance from Observation 10 and Observation 20:

  • Distance from Observation 13 and Observation 4:

  • Distance from Observation 13 and Observation 20:


1 0
Add a comment Improve this question Transcribed image text
✔ Recommended Answer
Answer #1

(a) From the given dendrogram, it can be seen that the observations \{7,12,15,17,8,16,11\} are

very closely spreaded, and hence should be grouped together, so the number of clusters should be 2 as in making 4 clusters, the left side clusters works well but the extreme right most cluster seems absurd leaving only 3 points in a cluster. Hence, 2 clusters are appropriate based on complete linkage.

(b) The Euclidean distance between observation 10 and 4 is calculated as follows:

\(\begin{aligned} d_{10,4} &=\sqrt{(0.032+0.510)^{2}+(0.741-0.207)^{2}+(0.7+0.004)^{2}+(-0.892+0.219)^{2} +(-0.173+0.943)^{2}+(-0.693+0.702)^{2}+(1.62-1.328)^{2}+(-0.863+0.724)^{2}} \\ &=1.4916 \\ d_{10,4} & \approx 1.492 \end{aligned}\)

The Euclidean distance between observation 10 and 20 is calculated as follows:

$$ \begin{aligned} d_{10,20} &=\sqrt{(0.032-0.466)^{2}+(0.741-0.474)^{2}+(0.7+0.490)^{2}+(-0.892-0.655)^{2} +(-0.173-0.083)^{2}+(-0.693+0.458)^{2}+(1.62-1.733)^{2}+(-0.863+0.721)^{2}} \\ &=2.0549 \\ d_{10,20} & \approx 20.55 \end{aligned} $$

The Euclidean distance between observation 13 and 4 is calculated as follows:

\(d_{13,4}=\sqrt{(0.195+0.510)^{2}+(0.875-0.207)^{2}+(0.748+0.004)^{2}+(-0.735+0.219)^{2} +(1.013+0.943)^{2}+(-0.489+0.702)^{2}+(2.275-1.328)^{2}+(-1.035+0.724)^{2}}\)

$$ \begin{aligned} &=2.5768 \\ d_{13,4} & \approx 2.577 \end{aligned} $$

The Euclidean distance between observation 13 and 20 is calculated as follows:

\(d_{13,20}=\sqrt{(0.195-0.466)^{2}+(0.875-0.474)^{2}+(0.748+0.490)^{2}+(-0.735-0.655)^{2} +(1.013-0.083)^{2}+(-0.489+0.458)^{2}+(2.275-1.733)^{2}+(-1.035+0.721)^{2}}\)

$$ \begin{aligned} &=2.2264 \\ d_{13.20} & \approx 2.226 \end{aligned} $$

Hence, the correct answers are as follows:

Distance from observation 10 and 4 is 1.492

Distance between observation 10 and 20 is 2.055 Distance between observation 13 and 4 is 2.577

Distance between observation 13 and 20 is 2.226


answered by: Lummoleft
Add a comment
Know the answer?
Add Answer to:
In an effort to inform political leaders and economists discussing the deregulation of electric and gas...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Similar Homework Help Questions
  • Business Analytics, Assignment on Clustering As part of the quarterly reviews, the manager of a r...

    Business Analytics, Assignment on Clustering As part of the quarterly reviews, the manager of a retail store analyzes the quality of customer service based on the periodic customer satisfaction ratings (on a scale of 1 to 10 with 1 = Poor and 10 = Excellent). To understand the level of service quality, which includes the waiting times of the customers in the checkout section, he collected data on 100 customers who visited the store; see the attached Excel file: ServiceQuality....

  • K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of...

    K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of the process made it popular to data analysts. The task is to form clusters of similar data objects (points, properties etc.). When the dataset given is unlabeled, we try to make some conclusion about the data by forming clusters. Now, the number of clusters can be pre-determined and number of points can have any range. The main idea behind the process is finding nearest...

  • All of the following questions are in relation to the following journal article which is available...

    All of the following questions are in relation to the following journal article which is available on Moodle: Parr CL, Magnus MC, Karlstad O, Holvik K, Lund-Blix NA, Jaugen M, et al. Vitamin A and D intake in pregnancy, infant supplementation and asthma development: the Norwegian Mother and Child Cohort. Am J Clin Nutr 2018:107:789-798 QUESTIONS: 1. State one hypothesis the author's proposed in the manuscript. 2. There is previous research that shows that adequate Vitamin A intake is required...

  • 8. Which of the following accounts has a normal debit balance? a. Accounts Payable b. Sales...

    8. Which of the following accounts has a normal debit balance? a. Accounts Payable b. Sales Returns and Allowances c. Sales d. Interest Revenue 9. Using a perpetual inventory system, the entry to record the purchase of $30,000 of merchandise on account would include a a. debit to Sales b. debit to Merchandise Inventory c. credit to Merchandise Inventory d. credit to Sales 10. A retailer purchases merchandise with a catalog list price of $15,000. The retailer receives a 30%...

  • 4. Perform a SWOT analysis for Fitbit. Based on your assessment of these, what are some strategic options for Fitbit go...

    4. Perform a SWOT analysis for Fitbit. Based on your assessment of these, what are some strategic options for Fitbit going forward? 5. Analyze the company’s financial performance. Do trends suggest that Fitbit’s strategy is working? 6.What recommendations would you make to Fitbit management to address the most important strategic issues facing the company? Fitbit, Inc., in 2017: Can Revive Its Strategy and It Reverse Mounting Losses? connect ROCHELLE R. BRUNSON Baylor University MARLENE M. REED Baylor University in the...

  • Risk management in Information Security today Everyday information security professionals are bombarded with marketing messages around...

    Risk management in Information Security today Everyday information security professionals are bombarded with marketing messages around risk and threat management, fostering an environment in which objectives seem clear: manage risk, manage threat, stop attacks, identify attackers. These objectives aren't wrong, but they are fundamentally misleading.In this session we'll examine the state of the information security industry in order to understand how the current climate fails to address the true needs of the business. We'll use those lessons as a foundation...

  • 10. Write a one-page summary of the attached paper? INTRODUCTION Many problems can develop in activated...

    10. Write a one-page summary of the attached paper? INTRODUCTION Many problems can develop in activated sludge operation that adversely affect effluent quality with origins in the engineering, hydraulic and microbiological components of the process. The real "heart" of the activated sludge system is the development and maintenance of a mixed microbial culture (activated sludge) that treats wastewater and which can be managed. One definition of a wastewater treatment plant operator is a "bug farmer", one who controls the aeration...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT