Workflow Element Store

  1. Augmentation
  2. Handling Missing Data
  3. Binning / Discretization
  4. Handling Noisy Data
  5. Handling Imbalanced Classes
  6. AutoEDA libraries
  7. Feature Extraction from Images
  8. Handling Time-Series Data
  9. Data Scaling and Normalization
  10. Dealing with Outliers
  11. Auto-Preprocessing libraries
  12. Feature Selection
  13. Interaction Features
  14. Data Transformations
  15. Textual Feature Extraction
  16. Annotation
  17. Polynomial Features
  18. Domain-Specific Feature Engineering
  19. Data Partitioning - Train, Validation, & Test
  20. Time-Based Features
  21. Handling Categorical Data
  22. Dimensionality Reduction
  1. Reinforcement Learning
  2. Natural Language Processing
  3. External Validation
  4. Weight Initialization
  5. Batch Size Selection
  6. Regression Analysis
  7. Learning Rate Scheduling
  8. Ensemble Techniques
  9. Regular Monitoring and Logging
  10. Transfer Learning
  11. Regularization
  12. Hyperparameter Tuning
  13. Model Comparison
  14. Batch Normalization
  15. AutoML
  16. Forecasting Techniques
  17. Word Embeddings
  18. Performance Visualization
  19. Cross-Validation
  20. Binary Classification Techniques
  21. Multiclass Classification Techniques
  22. Evaluation Metrics
  23. Transfer Learning
  24. Data Augmentation
  25. Model Interpretability
  26. Clustering
  27. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  28. Association Rules
  29. Early Stopping
  30. Recommendation Engine
  31. Regularization Techniques
  32. Network Analytics/ GeoSpatial Analytics
  33. Cross-Validation
  34. Blackbox - Neural Network Models
  1. Evidently.ai
  2. Kafka Brokers
  3. Apache Airflow
  4. Github
  5. Datawarehouse
  6. code repository
  7. Data Preprocessing pipeline models
  8. Databases
  9. model registry
  10. Github Actions
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)