Workflow Element Store

  1. Annotation
  2. Handling Missing Data
  3. Handling Time-Series Data
  4. Feature Selection
  5. Interaction Features
  6. AutoEDA libraries
  7. Handling Categorical Data
  8. Handling Imbalanced Classes
  9. Feature Extraction from Images
  10. Polynomial Features
  11. Auto-Preprocessing libraries
  12. Augmentation
  13. Dealing with Outliers
  14. Domain-Specific Feature Engineering
  15. Data Scaling and Normalization
  16. Textual Feature Extraction
  17. Dimensionality Reduction
  18. Binning / Discretization
  19. Handling Noisy Data
  20. Data Transformations
  21. Time-Based Features
  22. Data Partitioning - Train, Validation, & Test
  1. Regression Analysis
  2. Batch Size Selection
  3. Network Analytics/ GeoSpatial Analytics
  4. Data Augmentation
  5. Cross-Validation
  6. Weight Initialization
  7. Reinforcement Learning
  8. Transfer Learning
  9. Transfer Learning
  10. Hyperparameter Tuning
  11. Forecasting Techniques
  12. External Validation
  13. Recommendation Engine
  14. Performance Visualization
  15. Learning Rate Scheduling
  16. Evaluation Metrics
  17. Batch Normalization
  18. Multiclass Classification Techniques
  19. AutoML
  20. Blackbox - Neural Network Models
  21. Model Comparison
  22. Regularization Techniques
  23. Natural Language Processing
  24. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  25. Early Stopping
  26. Ensemble Techniques
  27. Word Embeddings
  28. Clustering
  29. Regularization
  30. Cross-Validation
  31. Binary Classification Techniques
  32. Model Interpretability
  33. Regular Monitoring and Logging
  34. Association Rules
  1. model registry
  2. Evidently.ai
  3. Datawarehouse
  4. Github
  5. Data Preprocessing pipeline models
  6. Kafka Brokers
  7. Apache Airflow
  8. Github Actions
  9. Databases
  10. code repository
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)