Workflow Element Store

  1. Data Partitioning - Train, Validation, & Test
  2. Handling Missing Data
  3. Domain-Specific Feature Engineering
  4. AutoEDA libraries
  5. Augmentation
  6. Annotation
  7. Data Scaling and Normalization
  8. Auto-Preprocessing libraries
  9. Binning / Discretization
  10. Data Transformations
  11. Feature Extraction from Images
  12. Dealing with Outliers
  13. Feature Selection
  14. Interaction Features
  15. Time-Based Features
  16. Handling Categorical Data
  17. Handling Imbalanced Classes
  18. Polynomial Features
  19. Textual Feature Extraction
  20. Handling Noisy Data
  21. Handling Time-Series Data
  22. Dimensionality Reduction
  1. Transfer Learning
  2. Multiclass Classification Techniques
  3. Cross-Validation
  4. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  5. Reinforcement Learning
  6. Batch Size Selection
  7. Word Embeddings
  8. Recommendation Engine
  9. Forecasting Techniques
  10. Regression Analysis
  11. Early Stopping
  12. Cross-Validation
  13. Performance Visualization
  14. AutoML
  15. Data Augmentation
  16. Batch Normalization
  17. Ensemble Techniques
  18. Blackbox - Neural Network Models
  19. Evaluation Metrics
  20. Clustering
  21. Binary Classification Techniques
  22. Hyperparameter Tuning
  23. Regularization Techniques
  24. Regularization
  25. Transfer Learning
  26. Natural Language Processing
  27. Learning Rate Scheduling
  28. Model Comparison
  29. External Validation
  30. Regular Monitoring and Logging
  31. Model Interpretability
  32. Weight Initialization
  33. Network Analytics/ GeoSpatial Analytics
  34. Association Rules
  1. Kafka Brokers
  2. Evidently.ai
  3. Github
  4. Datawarehouse
  5. Databases
  6. Github Actions
  7. model registry
  8. Data Preprocessing pipeline models
  9. Apache Airflow
  10. code repository
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)