Workflow Element Store

  1. Augmentation
  2. Handling Time-Series Data
  3. Dealing with Outliers
  4. Data Transformations
  5. AutoEDA libraries
  6. Annotation
  7. Dimensionality Reduction
  8. Handling Noisy Data
  9. Feature Extraction from Images
  10. Interaction Features
  11. Binning / Discretization
  12. Handling Categorical Data
  13. Auto-Preprocessing libraries
  14. Handling Missing Data
  15. Data Partitioning - Train, Validation, & Test
  16. Textual Feature Extraction
  17. Feature Selection
  18. Handling Imbalanced Classes
  19. Time-Based Features
  20. Polynomial Features
  21. Domain-Specific Feature Engineering
  22. Data Scaling and Normalization
  1. Early Stopping
  2. Hyperparameter Tuning
  3. Clustering
  4. AutoML
  5. Forecasting Techniques
  6. Word Embeddings
  7. Model Interpretability
  8. Recommendation Engine
  9. Evaluation Metrics
  10. Association Rules
  11. Network Analytics/ GeoSpatial Analytics
  12. Data Augmentation
  13. Transfer Learning
  14. Regularization Techniques
  15. Blackbox - Neural Network Models
  16. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  17. Batch Size Selection
  18. Regularization
  19. External Validation
  20. Transfer Learning
  21. Model Comparison
  22. Batch Normalization
  23. Learning Rate Scheduling
  24. Natural Language Processing
  25. Regression Analysis
  26. Cross-Validation
  27. Performance Visualization
  28. Reinforcement Learning
  29. Binary Classification Techniques
  30. Ensemble Techniques
  31. Multiclass Classification Techniques
  32. Weight Initialization
  33. Cross-Validation
  34. Regular Monitoring and Logging
  1. Apache Airflow
  2. Github
  3. Evidently.ai
  4. Databases
  5. Datawarehouse
  6. Data Preprocessing pipeline models
  7. code repository
  8. Kafka Brokers
  9. model registry
  10. Github Actions
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)