i.
Regression is a modelling technique used when the target variable is continuous; that is, the value we wish to predict/ infer is a continuous value. Examples of this are numbers on a continuous scale (height in inches, rainfall in cm, income, price, etc.)
Classification is a modelling technique used when the target variable is discrete; that is, the value we wish to predict/ infer is a discrete/ class value. For example, predict if an image is a cat or a dog, predict if a transaction is fraud or not fraud.
ii.
There are many classification methods like:
- Logistic regression
- Decision trees
- Random forest
- Naive Bayes classifier
- Neural network
iii.
y1 is a combination of i1 and i2 combined with a combination of weights, and a bias unit added.
Thus, we can say that
h1= b1 + w1*i1 + w3*i2 and
h2= b2 + w2*i1 + w4*i2
and
output,
y1= b3+ w5*h1 + w6*h2
Neural networks are used for classification. The neural networks learn the required weights and biases, which combine the input in different combinations and learn features which can be used to distinguish various classes, thus becoming a powerful mathematical method to perform classification tasks.
iv.
Clustering is a method of unsupervised learning. That is, clustering is used when the target variable is unknown.
Exploratory analysis: Clustering is a powerful method to group similar data points together. This grouping might be difficult to perform manually, but various clustering methods easily calculate distances between data points (both numerical and categorical) and group data together easily, thus helping in exploring and visualizing the data better, and understanding patterns.
Fraud detection: Clustering is also often used to perform fraud detection. Fraud detection is a problem of class imbalance. For every 100 non fraud transactions, there are probably 2 or 3 fraud transactions. Clustering is a powerful tool to group similar non fraud transactions together, and the fraud transactions together, and calculating the distance of a new point from both the classes. This is a very effective method to perform fraud detection.
Customer segmentation: Clustering is also used in customer segmentation, and understanding customer behaviour. Clustering is often used to group similar customers together and performing clear segregations among customer types (high value customers, low value customers, etc).
Classification and regression are commonly used processes in business analytics. Briefly explain the difference between...
Explain using Matlab code but also why you used the linear system please 1 Quadrature Rule A quadrature rule is a way to approximate integrals numerically i.e. using a computer). Many such quadrature rules can be derived by solving a simple linear system. Set up a linear system and then use Matlab to find the coefficients wo, W1, W2, W3, W4, W5 such that | f(x)dx = wof(0) + wif(0.2) + w2f(0.4) + w3f(0.6) +w4f(0.8) + w5f (1) for each...
Read this article. Then write a 250 word response on two of the programs you like the most. Open source business intelligence software 1. BIRT BIRT is an open source BI program that CloudTweaks says is often viewed as the industry standard. BIRT boasts “over 12 million downloads and over 2.5 million developers across 157 countries.” Its users include heavyweights such as Cisco, S1, and IBM (which is also a BIRT sponsor). They also have maturity going for them, as...
Chapter overview 1. Reasons for international trade Resources reasons Economic reasons Other reasons 2. Difference between international trade and domestic trade More complex context More difficult and risky Higher management skills required 3. Basic concept s relating to international trade Visible trade & invisible trade Favorable trade & unfavorable trade General trade system & special trade system Volume of international trade & quantum of international trade Commodity composition of international trade Geographical composition of international trade Degree / ratio of...