Question

SEMMA and CRISP-DM are two methodologies used in the domain of data mining. In your research...

SEMMA and CRISP-DM are two methodologies used in the domain of data mining. In your research on these methodologies you have probably noted some similarities in terms of phases represented in both approaches. Do you think that there could be scenarios where a hybrid methodology that uses both these approaches would be beneficial? Why? Describe the scenario and share your opinion/justification.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Data Mining refers to the process of discovering the patterns and trends in data in order to predict future results. There are two most widely recognized methodologies in the field of the Data Mining and they are SEMMA (Sample, Explore, Modify, Model and Assess) and CRISP-DM ( Cross Industry Standard Process for Data Mining).

SEMMA was proposed by the SAS institute which is one of the most popular companies that develop statistical software applications with their software package - Enterprise Miner. This methodology starts with the collection and sampling of the data followed by exploring the data in order to identify the trends and anomalies in the data. This is carried out in order to gain some information about the dataset. In the following step, the data is modified to create, select and transform the variables for the study. In the next step, the data is used to create a model that can be used for predictive analysis. In the end, the methodology involves evaluating the performance of the model. Although this methodology covers the statistical aspects of any data mining applications it does not take care of the implementation, design and analysis phases. It is application dependent and works well for the SAS Enterprise Miner software. This is because this methodology was specifically developed for the SAS Enterprise Miner software.

CRISP-DM is one of the most widely used analytics model. It was conceived in 1996. It tries to break down the process of data mining into six phases. These six phases include Business Understanding, Data Understanding, Data Preparation, Modelling, Evaluation, Deployment. The main advantage of CRISP-DM is that it takes care of the business understanding part of any data mining project very well. This is significantly important as the SEMMA does not take the aspirations of the stakeholder organization into account. It is also an industry, tool, and application-independent methodology.

There might be a scenario when a research problem might not only require good data analytics processing but also has to care for the aspirations of the stakeholder organization. In such a case, it would be better to use a hybrid combination of both SEMMA and CRISP-DM. The SEMMA will take care of the analytics part such as Data Collection, Exploratory Data Analysis, Data Modeling, and Model Evaluation while the CRISP-DM might take care of the implementation and developmental aspects of the application. The role of CRISP-DM is very crucial here as the development of a Data Mining application is a cyclic process and the model has to train after certain intervals of the time in order to ensure the consistency of the Data Mining application, therefore the CRISP-DM provides the implementational superiority to the Data Mining application. Since the CRISP-DM is not software specific and hence it can be used along SEMMA to develop and deploy a Data Mining application to the client.

Here's the solution to your question. Thanks for asking and happy learning!!!

Add a comment
Know the answer?
Add Answer to:
SEMMA and CRISP-DM are two methodologies used in the domain of data mining. In your research...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Read the case: Netflix Inc.: The Second Act - Moving into Streaming and complete your case...

    Read the case: Netflix Inc.: The Second Act - Moving into Streaming and complete your case analysis. Discuss the following: 1) briefly summarize the key marketing strategy issues in the case that are still relevant TODAY in addition to contemporary issues you find via research; 2) make thorough recommendations on how the issues should be handled; 3) provide a justification for the recommendations. Case write-ups should be 3-5 pages, double spaced, 12 font size in Times New Roman. The case...

  • Hello! Could you please write your own four paragraph (5-6 sentences per paragraph) take away or...

    Hello! Could you please write your own four paragraph (5-6 sentences per paragraph) take away or reflection of the below information? Please complete in 24 hours if possible. Thank you! RIS BOHNET THINKS firms are wasting their money on diversity training. The problem is, most programs just don’t work. Rather than run more workshops or try to eradicate the biases that cause discrimination, she says, companies need to redesign their processes to prevent biased choices in the first place. Bohnet...

  • Using the book, write another paragraph or two: write 170 words: Q: Compare the assumptions of...

    Using the book, write another paragraph or two: write 170 words: Q: Compare the assumptions of physician-centered and collaborative communication. How is the caregiver’s role different in each model? How is the patient’s role different? Answer: Physical-centered communication involves the specialists taking control of the conversation. They decide on the topics of discussion and when to end the process. The patient responds to the issues raised by the caregiver and acts accordingly. On the other hand, Collaborative communication involves a...

  • Heavy Equipment and Machinery Inc. Trial Balance At December 31, 2019

    What is the answer to these tables? here is all the information that had been given to me and my answers to the question that I think needs to be answered to complete the two tablesYou have been hired as a Financial Consultant by Heavy Equipment and Machinery Inc. (HEMI).  HEMI is a private corporation that has finished its first year of operations. HEMI's owners plan to list the business on the Toronto Stock Exchange  (TSE) in the next 5 years; accordingly,...

  • Discussion questions 1. What is the link between internal marketing and service quality in the ai...

    Discussion questions 1. What is the link between internal marketing and service quality in the airline industry? 2. What internal marketing programmes could British Airways put into place to avoid further internal unrest? What potential is there to extend auch programmes to external partners? 3. What challenges may BA face in implementing an internal marketing programme to deliver value to its customers? (1981)ǐn the context ofbank marketing ths theme has bon pururd by other, nashri oriented towards the identification of...

  • How can we assess whether a project is a success or a failure? This case presents...

    How can we assess whether a project is a success or a failure? This case presents two phases of a large business transformation project involving the implementation of an ERP system with the aim of creating an integrated company. The case illustrates some of the challenges associated with integration. It also presents the obstacles facing companies that undertake projects involving large information technology projects. Bombardier and Its Environment Joseph-Armand Bombardier was 15 years old when he built his first snowmobile...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT