Workflow Element Store

  1. AutoEDA libraries
  2. Binning / Discretization
  3. Data Scaling and Normalization
  4. Handling Missing Data
  5. Annotation
  6. Dimensionality Reduction
  7. Handling Imbalanced Classes
  8. Interaction Features
  9. Data Transformations
  10. Feature Selection
  11. Feature Extraction from Images
  12. Textual Feature Extraction
  13. Time-Based Features
  14. Domain-Specific Feature Engineering
  15. Handling Categorical Data
  16. Handling Noisy Data
  17. Polynomial Features
  18. Data Partitioning - Train, Validation, & Test
  19. Auto-Preprocessing libraries
  20. Dealing with Outliers
  21. Handling Time-Series Data
  22. Augmentation
  1. Blackbox - Neural Network Models
  2. Regression Analysis
  3. Model Interpretability
  4. Regularization Techniques
  5. Model Comparison
  6. Natural Language Processing
  7. Word Embeddings
  8. Reinforcement Learning
  9. External Validation
  10. Weight Initialization
  11. Data Augmentation
  12. Performance Visualization
  13. Transfer Learning
  14. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  15. Evaluation Metrics
  16. Transfer Learning
  17. Cross-Validation
  18. Binary Classification Techniques
  19. Learning Rate Scheduling
  20. Clustering
  21. Early Stopping
  22. Recommendation Engine
  23. Ensemble Techniques
  24. Batch Size Selection
  25. Batch Normalization
  26. Forecasting Techniques
  27. Network Analytics/ GeoSpatial Analytics
  28. Regularization
  29. Regular Monitoring and Logging
  30. Multiclass Classification Techniques
  31. Association Rules
  32. Cross-Validation
  33. Hyperparameter Tuning
  34. AutoML
  1. model registry
  2. Apache Airflow
  3. Github Actions
  4. Github
  5. Evidently.ai
  6. Datawarehouse
  7. code repository
  8. Databases
  9. Kafka Brokers
  10. Data Preprocessing pipeline models
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)