Workflow Element Store

  1. Binning / Discretization
  2. Textual Feature Extraction
  3. Augmentation
  4. Handling Noisy Data
  5. Handling Missing Data
  6. Dealing with Outliers
  7. Domain-Specific Feature Engineering
  8. Handling Categorical Data
  9. Data Scaling and Normalization
  10. Interaction Features
  11. AutoEDA libraries
  12. Handling Time-Series Data
  13. Annotation
  14. Time-Based Features
  15. Handling Imbalanced Classes
  16. Feature Extraction from Images
  17. Dimensionality Reduction
  18. Feature Selection
  19. Auto-Preprocessing libraries
  20. Polynomial Features
  21. Data Partitioning - Train, Validation, & Test
  22. Data Transformations
  1. AutoML
  2. Regularization
  3. Performance Visualization
  4. Ensemble Techniques
  5. Reinforcement Learning
  6. Evaluation Metrics
  7. Regular Monitoring and Logging
  8. Cross-Validation
  9. Model Interpretability
  10. Multiclass Classification Techniques
  11. Batch Normalization
  12. Forecasting Techniques
  13. Batch Size Selection
  14. Transfer Learning
  15. Regularization Techniques
  16. Blackbox - Neural Network Models
  17. Network Analytics/ GeoSpatial Analytics
  18. Early Stopping
  19. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  20. Transfer Learning
  21. Data Augmentation
  22. Association Rules
  23. Learning Rate Scheduling
  24. Word Embeddings
  25. Model Comparison
  26. Hyperparameter Tuning
  27. Binary Classification Techniques
  28. Natural Language Processing
  29. External Validation
  30. Clustering
  31. Regression Analysis
  32. Weight Initialization
  33. Recommendation Engine
  34. Cross-Validation
  1. Github
  2. Datawarehouse
  3. Github Actions
  4. Databases
  5. model registry
  6. Apache Airflow
  7. Data Preprocessing pipeline models
  8. Evidently.ai
  9. Kafka Brokers
  10. code repository
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)