We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...

Question

Question

We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...

We are interested in detecting communities in a social media dataset.
1. How would you mine this problem?
2. Choose a social network and explain the kind of data cleaning you need.
3. What kind of data mining algorithm can you use?

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

1. Recently, social networking sites are offering a rich resource of heterogeneous data. The analysis of such data can lead to the discovery of unknown information and relations in these networks. The detection of communities including ‘similar’ nodes is a challenging topic in the analysis of social network data, and it has been widely studied in the social networking community in the context of underlying graph structure. Online social networks, in addition to having graph structures, include effective user information within networks. Using this information leads to enhance quality of community discovery. In this study, a method of community discovery is provided. Besides communication among nodes to improve the quality of the discovered communities, content information is used as well. This is a new approach based on frequent patterns and the actions of users on networks, particularly social networking sites where users carry out their preferred activities. The main contributions of proposed method are twofold: First, based on the interests and activities of users on networks, some small communities of similar users are discovered, and then by using social relations, the discovered communities are extended. The F-measure is used to evaluate the results of two real-world data set demonstrating that the proposed method principals to improve the community detection quality.

2. If you are analyzing your social media activities without cleaning your data first, you are wasting your time. Clean data is the key to getting your strategy right.

In social media, conversations aren't always that clean.

By unclean conversations we mean irrelevant ones. With the huge amount of existing social conversations, there naturally comes a lot of irrelevant ones too.

When speaking of companies and social data, irrelevant data refers to spam, ads, posts by the company itself or its employees, as well as posts not related to the brand. In other words, "noise". Also, not everything is posted by real humans. Some can be created by social media robots, so called bots.

Let's say that ACME brand has received 12 500 messages during the past seven days. Of all the messages, 77 % was noise. Which means, only 23 % of the whole conversation is relevant and created by real humans. Hence, you can ask yourself, would you rather make your decisions based on spam, as in the total of 12 500 messages, or on those 2875 messages actually created by real users?

Filtering out and removing the noise is indeed very time consuming, yet vital. In marketing, as well as in product development, it’s crucial to base your decisions on reliable and relevant data in order to understand what your customers really want.

3. The Balanced Link Density Label Propagation Algorithm is used .

The proposed method (BLDLP) substitutes random selection from famous LPA algorithm with a rational choice.

The BLDLP algorithm results are more stable

The BLDLP algorithm increases the original time efficiency

Add a comment

Answer 2

We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...

Homework Answers

Add Answer to:
We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...

Post as a guest

Earn Coins

In a 1-2 page paper explain the difference in how you would mine data based on the 3 categories; Prediction, Clustering,...

• Describe in detail a problem domain in the area of GIS. • Explain in detail how you would gather data needed to address this problem. Is there a public dataset available? Would you need to purchase...

3. (25 pts) Consider the data points: t y 0 1.20 1 1.16 2 2.34 3 6.08 ake a least squares fitting of these data using the model yü)- Be + Be-. Suppose we want to m (a) Explain how you would comput...

1. Write what would be an effective title for the passage. 2. What kind of language...

5. (20 pts) Suppose that we have a dataset {(yi, x, Tt2, X;3), i,1,... ,n} together with some general belief on the dat...

1. Describe in a couple of sentences what the data describes. 2. In one or two sentences, explain...

In this Module 2 Discussion, we shall discuss how to use R to obtain information by...

2. In what units of measure do we use for a zone of inhibition? 3. Given...

What kind of website have you built . e-commerce, lead generator, informational .Depending on which of...

3) Out of the following, name which kind of attack you carried out in part 1...

We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...

Homework Answers

Add Answer to: We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...

Post as a guest

Earn Coins

Add Answer to:
We are interested in detecting communities in a social media dataset. 1. How would you mine this problem? 2. Choose a social network and explain the kind of data cleaning you need. 3. What kind of dat...