Data Science Digital Book

Learn About Data Science Concepts and Methodologies Through Articles

Mastering Data Science concepts and methodologies is crucial for success in the field. 360DigiTMG offers comprehensive courses that cover essential techniques and tools, empowering you to analyze data effectively and make data-driven decisions.

by Mr. Bharani Kumar

Forecasting

July 15, 2024

A crucial component of the data is data that has been gathered across intervals of time that are equally spaced apart.

Multi-Layers_Perceptron_(MLP)___Artificial_Neural_Network_(ANN).gif

by Mr. Bharani Kumar

Multi-Layered Perceptron (MLP) / Artificial Neural Network (ANN)

July 15, 2024

The nonlinear pattern will not be captured by the mere existence of hidden layers.

by Mr. Bharani Kumar

Perceptron Algorithm

July 15, 2024

The goal of artificial intelligence is to simulate the human brain.

by Mr. Bharani Kumar

Deep Learning Primer

July 15, 2023

In order to simulate biological neural networks, artificial neural networks are utilised.

by Mr. Bharani Kumar

Support Vector Machine

July 22, 2023

Almost every learning job, including classification and numerical prediction, may be used with SVMs.

by Mr. Bharani Kumar

Logistic Regression

January 14, 2023

Predicts the probability of the outcome class. The algorithm finds the linear relationship between the independent variables and a link function of these probabilities.

by Mr. Bharani Kumar

Continuous Value Prediction

July 15, 2023

Ordinary Least Squares Technique to find the best fit line. The best fit line is the line which has minimum square deviations from all the data points to the line.

by Mr. Bharani Kumar

Decision Tree

July 15, 2023

Decision Trees are Nonparametric hierarchical model, that works on a divide & conquer strategy, a rule-based algorithm that works on the principle of recursive partitioning.

by Mr. Bharani Kumar

Naive Bayes Algorithm

July 15, 2023

A machine learning method called Naive Bayes is based on the probability principle.

by Mr. Bharani Kumar

K-Nearest Neighbor

July 15, 2023

KNN is based on the calculating distance among the various points. The distance can be any of the distance measures such as Euclidean distance discussed in previous sections.

by Mr. Bharani Kumar

Model Evaluation Techniques

July 15, 2023

The set of error functions below can be used to assess the model if the output variable 'Y' is continuous.

by Mr. Bharani Kumar

Data Mining Supervised Learning

July 15, 2023

Steps based on Training & Testing datasets - Get the historical/past data needed for analysis which is the output of data cleansing.

by Mr. Bharani Kumar

Text Mining

January 13, 2023

Analyzing unstructured Text data by generating structured data in key-value pair form.

by Mr. Bharani Kumar

Network Analysis

July 15, 2023

A distinct sort of data, known as network data or graph data, necessitates a different kind of analysis.

by Mr. Bharani Kumar

Recommender Systems

July 15, 2023

'Users' are typically the rows in the data utilised for the analysis, and 'Items' will be the columns.

by Mr. Bharani Kumar

Association Rules

July 15, 2023

The same concept underlies Relationship Mining, Market Basket Analysis, and Affinity Analysis: how are two entities connected to one another and is there any reliance between them.

by Mr. Bharani Kumar

Mathematical Foundations

July 15, 2023

Feature extraction of input variables from hundreds of variables is known as Dimensionality Reduction.

by Mr. Bharani Kumar

Hierarchical Clustering

July 15, 2023

Agglomerative technique (top-down hierarchy of clusters) or Divisive technique (bottom-up hierarchy of clusters) are other names for hierarchical clustering.

by Mr. Bharani Kumar

Types of Clustering / Segmentation Algorithms

July 15, 2023

Similar records to be grouped together. High intra-class similarity, Dissimilar records to be assigned to different groups. Less inter-class similarity

by Mr. Bharani Kumar

Unsupervised Learning - Preliminaries

July 15, 2023

Standardize or Normalize the variables before calculating the distance if the variables scale or are of different units.

by Mr. Bharani Kumar

CRISP - DM Model Building Using Data Mining

July 15, 2023

If the outcome variable 'Y' in the historical data is known, then supervised learning tasks are applied to the historical data. Predictive modelling and machine learning are other names for supervised learning.

by Mr. Bharani Kumar

Feature Engineering

July 15, 2024

Feature Extraction and Feature Engineering are other names for attribute generation. Try to use domain expertise to create more insightful derived variables from the provided variables.