Workflow Element Store

  1. Augmentation
  2. Dealing with Outliers
  3. Handling Noisy Data
  4. Time-Based Features
  5. Dimensionality Reduction
  6. Handling Imbalanced Classes
  7. Data Partitioning - Train, Validation, & Test
  8. Interaction Features
  9. Binning / Discretization
  10. Handling Categorical Data
  11. Data Transformations
  12. Annotation
  13. Feature Selection
  14. Handling Missing Data
  15. Handling Time-Series Data
  16. Domain-Specific Feature Engineering
  17. Data Scaling and Normalization
  18. Polynomial Features
  19. Textual Feature Extraction
  20. Feature Extraction from Images
  21. Auto-Preprocessing libraries
  22. AutoEDA libraries
  1. Transfer Learning
  2. Learning Rate Scheduling
  3. Recommendation Engine
  4. Multiclass Classification Techniques
  5. Batch Normalization
  6. Regular Monitoring and Logging
  7. Forecasting Techniques
  8. Cross-Validation
  9. Regularization Techniques
  10. Early Stopping
  11. Hyperparameter Tuning
  12. Transfer Learning
  13. External Validation
  14. Natural Language Processing
  15. Evaluation Metrics
  16. Performance Visualization
  17. Clustering
  18. Regularization
  19. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  20. Binary Classification Techniques
  21. Ensemble Techniques
  22. Weight Initialization
  23. AutoML
  24. Association Rules
  25. Cross-Validation
  26. Reinforcement Learning
  27. Data Augmentation
  28. Blackbox - Neural Network Models
  29. Model Comparison
  30. Word Embeddings
  31. Regression Analysis
  32. Model Interpretability
  33. Network Analytics/ GeoSpatial Analytics
  34. Batch Size Selection
  1. model registry
  2. Kafka Brokers
  3. Datawarehouse
  4. Data Preprocessing pipeline models
  5. Apache Airflow
  6. code repository
  7. Databases
  8. Github Actions
  9. Github
  10. Evidently.ai
ML Workflow - Architecture
  • Element belongs to model
  • Element not belongs to model

Data Sources

Streaming Data

Batch Data

Cloud Storage

Labeled Data

Feature Engineering Pipeline

Experimentation

ML Model

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Orchestration Component

Artifact Store

CI/CD Component

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)