Call Us

Home / Blog / Data Science Digital Book

Data Science Digital Book

Forecasting.gif
July 15, 2023

A crucial component of the data is data that has been gathered across intervals of time that are equally spaced apart.

Multi-Layers_Perceptron_(MLP)___Artificial_Neural_Network_(ANN).gif
July 15, 2023

The nonlinear pattern will not be captured by the mere existence of hidden layers.

Perceptron_Algorithm.gif
July 15, 2023

The goal of artificial intelligence is to simulate the human brain.

Deep_Learning_Primer.gif
July 15, 2023

In order to simulate biological neural networks, artificial neural networks are utilised.

Support_Vector_Machine.gif
July 22, 2023

Almost every learning job, including classification and numerical prediction, may be used with SVMs.

Logistic_Regression.gif
January 14, 2023

Predicts the probability of the outcome class. The algorithm finds the linear relationship between the independent variables and a link function of these probabilities.

Continuous_Value_Prediction.gif
July 15, 2023

Ordinary Least Squares Technique to find the best fit line. The best fit line is the line which has minimum square deviations from all the data points to the line.

Decision_Tree.gif
July 15, 2023

Decision Trees are Nonparametric hierarchical model, that works on a divide & conquer strategy, a rule-based algorithm that works on the principle of recursive partitioning.

Naive_Bayes_Algorithm.gif
July 15, 2023

A machine learning method called Naive Bayes is based on the probability principle.

K-Nearest_Neighbour.gif
July 15, 2023

KNN is based on the calculating distance among the various points. The distance can be any of the distance measures such as Euclidean distance discussed in previous sections.

Model_Evaluation_Techniques.gif
July 15, 2023

The set of error functions below can be used to assess the model if the output variable 'Y' is continuous.

Data_Mining_Supervised_Learning.gif
July 15, 2023

Steps based on Training & Testing datasets - Get the historical/past data needed for analysis which is the output of data cleansing.

Text_Mining.gif
January 13, 2023

Analyzing unstructured Text data by generating structured data in key-value pair form.

Network_Analysis.gif
July 15, 2023

A distinct sort of data, known as network data or graph data, necessitates a different kind of analysis.

Recommender_Systems.gif
July 15, 2023

'Users' are typically the rows in the data utilised for the analysis, and 'Items' will be the columns.

Association_Rules.gif
July 15, 2023

The same concept underlies Relationship Mining, Market Basket Analysis, and Affinity Analysis: how are two entities connected to one another and is there any reliance between them.

Mathematical_Foundations.gif
July 15, 2023

Feature extraction of input variables from hundreds of variables is known as Dimensionality Reduction.

Hierarchical_Clustering.gif
July 15, 2023

Agglomerative technique (top-down hierarchy of clusters) or Divisive technique (bottom-up hierarchy of clusters) are other names for hierarchical clustering.

Types_of_Clustering___Segmentation_Algorithms.gif
July 15, 2023

Similar records to be grouped together. High intra-class similarity, Dissimilar records to be assigned to different groups. Less inter-class similarity

Unsupervised_Preliminaries.gif
July 15, 2023

Standardize or Normalize the variables before calculating the distance if the variables scale or are of different units.

CRISP_-_DM_Model_Building_Using_Data_Mining.gif
July 15, 2023

If the outcome variable 'Y' in the historical data is known, then supervised learning tasks are applied to the historical data. Predictive modelling and machine learning are other names for supervised learning.

Feature_Engineering.gif
July 15, 2023

Feature Extraction and Feature Engineering are other names for attribute generation. Try to use domain expertise to create more insightful derived variables from the provided variables.

Data_Quality_Analysis.gif
July 15, 2023

The goal of this stage is to locate any potential data mistakes, flaws, or problems.

Graphical-Representations.png
July 15, 2023

Univariate Analysis - Analysis of a single variable is called Univariate Analysis.

CRISP_-_DM_Data_Cleansing___Data_Preparation.gif
July 15, 2023

Other names for data cleaning include data preparation, data organisation, munging, and data wrangling.

CRISP___DM_Data_Collection.gif
July 15, 2023

Cross Industry Standard Process for Data Mining. Articulate the business problem by understanding the client/customer requirements

Ingredients_of_AI.gif
July 15, 2023

Definition of Artificial Intelligence, Data Science, Data Mining, Machine Learning, Deep Learning, Reinforcement Learning (RL)

You may also like...

0_-Cover_Image-04.png
February 19, 2024

In the current landscape of Large Language Models (LLMs), managing text efficiently has become paramount. This blog introduces Chroma DB, an open-source tool specifically designed to handle text documents, convert text to embeddings, and execute similarity searches with ease.

09_-_Cover_Image-03.png
February 19, 2024

In a world where time is the currency and data is the kingdom, Annoy emerges as the undisputed champion of high-speed nearest neighbor searches.

08_-_Cover_Image-04.png
February 19, 2024

In the age of artificial intelligence, where data reigns supreme, finding efficient ways to store and manage information is more crucial than ever.

Make an Enquiry