Workflow Element Store

  1. Dimensionality Reduction
  2. Handling Missing Data
  3. Textual Feature Extraction
  4. Handling Categorical Data
  5. Auto-Preprocessing libraries
  6. Handling Noisy Data
  7. Handling Time-Series Data
  8. Domain-Specific Feature Engineering
  9. AutoEDA libraries
  10. Augmentation
  11. Feature Extraction from Images
  12. Feature Selection
  13. Annotation
  14. Polynomial Features
  15. Handling Imbalanced Classes
  16. Time-Based Features
  17. Binning / Discretization
  18. Data Transformations
  19. Dealing with Outliers
  20. Data Partitioning - Train, Validation, & Test
  21. Data Scaling and Normalization
  22. Interaction Features
  1. Data Augmentation
  2. External Validation
  3. Natural Language Processing
  4. Learning Rate Scheduling
  5. Regularization
  6. Early Stopping
  7. Forecasting Techniques
  8. Blackbox - Neural Network Models
  9. Batch Size Selection
  10. Regular Monitoring and Logging
  11. AutoML
  12. Transfer Learning
  13. Batch Normalization
  14. Word Embeddings
  15. Evaluation Metrics
  16. Clustering
  17. Network Analytics/ GeoSpatial Analytics
  18. Transfer Learning
  19. Reinforcement Learning
  20. Hyperparameter Tuning
  21. Performance Visualization
  22. Model Comparison
  23. Association Rules
  24. Regression Analysis
  25. Regularization Techniques
  26. Multiclass Classification Techniques
  27. Cross-Validation
  28. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  29. Cross-Validation
  30. Weight Initialization
  31. Ensemble Techniques
  32. Recommendation Engine
  33. Model Interpretability
  34. Binary Classification Techniques
  1. code repository
  2. Github Actions
  3. Datawarehouse
  4. model registry
  5. Evidently.ai
  6. Data Preprocessing pipeline models
  7. Github
  8. Apache Airflow
  9. Databases
  10. Kafka Brokers
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)