Question
Discuss three (3) issues to consider during data cleaning. Give an example of a real world problem for each issue.?
Data Cleansing or Scrubbing is the process of detecting & removing inconsistencies & errors from data to improve the quality of data. The need for data cleansing increases significantly when multiple data sources are integrated. This process of making data accurate and consistent is riddled with many problems, few of which are mentioned below:
Misspellings:
Misspellings occur mostly due to typing error. The wrong spelling can be detected and corrected for common words and grammatical errors, however, as database constrain huge amount of data that is unique, it is hard to detect spelling mistake at input-level. Further, Spelling mistakes in data such as names, addresses are always difficult to identify and correct.
Lexical Errors:
Lexical errors occur in data due to name discrepancies between the structure of the data items and the specified format. Example, a particular database records attribute for name, age, sex and height. When an individual does not enter an intermediate value say (age) the data for following attributes changes field. In above case, when individual does not enter value for age, value for sex, say male is read as age and value of height is read as sex.
Misfielded Value:
Misfielded value problem occurs when the values entered are correct as far format is concerned but does not belong to the field. Example in field of city, value recorded is Germany.
Domain Format Errors:
Domain format errors occur when the value for a particular attribute is correct but do not comply with format of domain. Example, a particular NAME database requires first name and surname to be separated with comma but the input is without comma. In this case while the input may be correct but it does not comply with domain format.
Irregularities:
Irregularities deal with non-uniform use of units or values. Example while doing entry of salary of employee, the salary is mentioned using different currencies. This kind of data requires subjective interpretation and can often result in wrong results.
Missing Values:
Missing values occur as a result of omissions that happen while collecting the data. They signify unavailability of values during process of data entry. Both dummy values and null values are included in missing values. For example, 000-0000 and 999-9999 in the telephone number field.
Question Discuss three (3) issues to consider during data cleaning. Give an example of a real...
Question 1 Discuss three (3) issues to consider during data integration. Give an example of a real world problem for each issue.
Give an example of a real world application of linear regression. Discuss how this analytical tool provides important information for decision making in this example.
Give a real-world example of where ratio or proportions are used. 2. Give a real-world example of direct variation. 3. When you go out to dinner at a restaurant, how does your party determine the tip to leave the server?
Please Answer Both Questions a.Define, discuss, and give real world examples of the main types of social-political-economic systems. b.Define, discuss, and give real world examples of the 3 significant questions that each economic system must answer regardless of social-political-economic system.
Give three example of issues that have been affected by selective incorporation?
Define each of the following recovery technique and give a real-world example of where the technique could be used effectively. Reprocessing transactions
1.How Do You Measure The Efficiency Of Resource Allocation? Give An Example For A Real-world ... Your question has been answered Let us know if you got a helpful answer. Rate this answer Question: 1.How do you measure the efficiency of resource allocation? Give an example for a real-world appl... 1.How do you measure the efficiency of resource allocation? Give an example for a real-world application of measuring the efficiency of resource allocation. 2.Suppose the geographical tension in the Arab...
Give an example of data that could be obtained to make a prediction. Discuss lurking variables and why the correlation may not imply causation. Correlation and Regression is typically most students’ favorite area in statistics because it gives the ability to make predictions. Give an example of data that could be obtained to make a prediction. Discuss lurking variables and why the correlation may not imply causation.
Assume you are the chief accountant making a presentation during the stockholders’ annual meeting for your corporation. Provide a brief explanation to stockholders on each of the following questions: Shares In what ways can shares be “preferred”? In which ways are they similar and different from common shares? Give real-world examples. How does the book value of shares of stock differ from the market value of shares of stock? Use a real-world example in your answer. Dividends Discuss at least...
Problem 3. Prove or give a counter example 1. If an converges to a real limit then limn700 (m)" = 0. 2. If an is a positive sequence satisfying limn+ ()" = 0 then it con- verges.