Workflow Element Store

  1. Interaction Features
  2. Binning / Discretization
  3. Auto-Preprocessing libraries
  4. Handling Time-Series Data
  5. Dealing with Outliers
  6. Textual Feature Extraction
  7. Domain-Specific Feature Engineering
  8. AutoEDA libraries
  9. Annotation
  10. Handling Categorical Data
  11. Handling Missing Data
  12. Polynomial Features
  13. Data Scaling and Normalization
  14. Dimensionality Reduction
  15. Handling Noisy Data
  16. Data Partitioning - Train, Validation, & Test
  17. Feature Extraction from Images
  18. Feature Selection
  19. Handling Imbalanced Classes
  20. Data Transformations
  21. Augmentation
  22. Time-Based Features
  1. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  2. Model Comparison
  3. Cross-Validation
  4. Cross-Validation
  5. Network Analytics/ GeoSpatial Analytics
  6. External Validation
  7. Natural Language Processing
  8. Multiclass Classification Techniques
  9. Performance Visualization
  10. Reinforcement Learning
  11. Regular Monitoring and Logging
  12. Word Embeddings
  13. Learning Rate Scheduling
  14. Evaluation Metrics
  15. Regularization
  16. Regression Analysis
  17. Transfer Learning
  18. Recommendation Engine
  19. Data Augmentation
  20. Regularization Techniques
  21. Batch Size Selection
  22. Model Interpretability
  23. Binary Classification Techniques
  24. Association Rules
  25. Clustering
  26. Early Stopping
  27. AutoML
  28. Ensemble Techniques
  29. Forecasting Techniques
  30. Blackbox - Neural Network Models
  31. Weight Initialization
  32. Transfer Learning
  33. Batch Normalization
  34. Hyperparameter Tuning
  1. Github
  2. Apache Airflow
  3. Datawarehouse
  4. code repository
  5. Databases
  6. Github Actions
  7. model registry
  8. Data Preprocessing pipeline models
  9. Evidently.ai
  10. Kafka Brokers
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)