Workflow Element Store

  1. Data Scaling and Normalization
  2. Dealing with Outliers
  3. Dimensionality Reduction
  4. Domain-Specific Feature Engineering
  5. Handling Imbalanced Classes
  6. Data Partitioning - Train, Validation, & Test
  7. Textual Feature Extraction
  8. Auto-Preprocessing libraries
  9. Feature Extraction from Images
  10. Handling Missing Data
  11. Binning / Discretization
  12. Interaction Features
  13. AutoEDA libraries
  14. Data Transformations
  15. Handling Time-Series Data
  16. Time-Based Features
  17. Polynomial Features
  18. Augmentation
  19. Feature Selection
  20. Annotation
  21. Handling Categorical Data
  22. Handling Noisy Data
  1. Network Analytics/ GeoSpatial Analytics
  2. Natural Language Processing
  3. Reinforcement Learning
  4. AutoML
  5. Cross-Validation
  6. Batch Normalization
  7. Early Stopping
  8. Data Augmentation
  9. Model Interpretability
  10. Word Embeddings
  11. Regularization Techniques
  12. Recommendation Engine
  13. Transfer Learning
  14. Forecasting Techniques
  15. Weight Initialization
  16. Regular Monitoring and Logging
  17. Regularization
  18. Transfer Learning
  19. Clustering
  20. Association Rules
  21. External Validation
  22. Cross-Validation
  23. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  24. Regression Analysis
  25. Evaluation Metrics
  26. Binary Classification Techniques
  27. Batch Size Selection
  28. Blackbox - Neural Network Models
  29. Multiclass Classification Techniques
  30. Ensemble Techniques
  31. Model Comparison
  32. Learning Rate Scheduling
  33. Performance Visualization
  34. Hyperparameter Tuning
  1. model registry
  2. Data Preprocessing pipeline models
  3. Evidently.ai
  4. Github
  5. Databases
  6. Github Actions
  7. Apache Airflow
  8. code repository
  9. Datawarehouse
  10. Kafka Brokers
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)