Workflow Element Store

  1. Annotation
  2. Binning / Discretization
  3. Handling Imbalanced Classes
  4. AutoEDA libraries
  5. Data Partitioning - Train, Validation, & Test
  6. Augmentation
  7. Time-Based Features
  8. Handling Noisy Data
  9. Interaction Features
  10. Dimensionality Reduction
  11. Handling Time-Series Data
  12. Auto-Preprocessing libraries
  13. Data Transformations
  14. Handling Categorical Data
  15. Feature Extraction from Images
  16. Domain-Specific Feature Engineering
  17. Dealing with Outliers
  18. Feature Selection
  19. Handling Missing Data
  20. Polynomial Features
  21. Textual Feature Extraction
  22. Data Scaling and Normalization
  1. Batch Size Selection
  2. Association Rules
  3. Forecasting Techniques
  4. Transfer Learning
  5. Recommendation Engine
  6. Regularization Techniques
  7. Learning Rate Scheduling
  8. Blackbox - Neural Network Models
  9. Evaluation Metrics
  10. External Validation
  11. Ensemble Techniques
  12. Network Analytics/ GeoSpatial Analytics
  13. Regularization
  14. Batch Normalization
  15. Cross-Validation
  16. Reinforcement Learning
  17. Early Stopping
  18. Clustering
  19. Natural Language Processing
  20. Regular Monitoring and Logging
  21. Multiclass Classification Techniques
  22. Binary Classification Techniques
  23. Performance Visualization
  24. Data Augmentation
  25. Model Interpretability
  26. Model Comparison
  27. Word Embeddings
  28. AutoML
  29. Transfer Learning
  30. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  31. Regression Analysis
  32. Weight Initialization
  33. Hyperparameter Tuning
  34. Cross-Validation
  1. Data Preprocessing pipeline models
  2. Apache Airflow
  3. code repository
  4. Datawarehouse
  5. Github
  6. Databases
  7. Evidently.ai
  8. Kafka Brokers
  9. model registry
  10. Github Actions
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)