Workflow Element Store

  1. Dimensionality Reduction
  2. Augmentation
  3. Domain-Specific Feature Engineering
  4. Feature Selection
  5. Time-Based Features
  6. Data Transformations
  7. Handling Time-Series Data
  8. Dealing with Outliers
  9. Handling Noisy Data
  10. Handling Missing Data
  11. Feature Extraction from Images
  12. Textual Feature Extraction
  13. Data Scaling and Normalization
  14. AutoEDA libraries
  15. Auto-Preprocessing libraries
  16. Binning / Discretization
  17. Polynomial Features
  18. Handling Categorical Data
  19. Interaction Features
  20. Handling Imbalanced Classes
  21. Annotation
  22. Data Partitioning - Train, Validation, & Test
  1. Batch Normalization
  2. Data Augmentation
  3. Model Comparison
  4. External Validation
  5. Reinforcement Learning
  6. Regression Analysis
  7. Weight Initialization
  8. Blackbox - Neural Network Models
  9. Recommendation Engine
  10. Regularization
  11. Association Rules
  12. Transfer Learning
  13. Network Analytics/ GeoSpatial Analytics
  14. Model Interpretability
  15. Natural Language Processing
  16. Performance Visualization
  17. Ensemble Techniques
  18. Transfer Learning
  19. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  20. Early Stopping
  21. Regular Monitoring and Logging
  22. Forecasting Techniques
  23. AutoML
  24. Learning Rate Scheduling
  25. Clustering
  26. Regularization Techniques
  27. Multiclass Classification Techniques
  28. Cross-Validation
  29. Word Embeddings
  30. Evaluation Metrics
  31. Hyperparameter Tuning
  32. Binary Classification Techniques
  33. Batch Size Selection
  34. Cross-Validation
  1. code repository
  2. Github Actions
  3. Databases
  4. Kafka Brokers
  5. Apache Airflow
  6. Datawarehouse
  7. Data Preprocessing pipeline models
  8. Github
  9. Evidently.ai
  10. model registry
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)