Workflow Element Store

  1. Handling Imbalanced Classes
  2. Textual Feature Extraction
  3. Binning / Discretization
  4. Dimensionality Reduction
  5. Domain-Specific Feature Engineering
  6. Feature Extraction from Images
  7. Handling Missing Data
  8. Time-Based Features
  9. Handling Time-Series Data
  10. Data Partitioning - Train, Validation, & Test
  11. Data Transformations
  12. Dealing with Outliers
  13. Annotation
  14. Polynomial Features
  15. Handling Categorical Data
  16. Data Scaling and Normalization
  17. Interaction Features
  18. Auto-Preprocessing libraries
  19. AutoEDA libraries
  20. Handling Noisy Data
  21. Augmentation
  22. Feature Selection
  1. Binary Classification Techniques
  2. Early Stopping
  3. Model Interpretability
  4. Regular Monitoring and Logging
  5. Word Embeddings
  6. Learning Rate Scheduling
  7. Transfer Learning
  8. Performance Visualization
  9. Hyperparameter Tuning
  10. Model Comparison
  11. Reinforcement Learning
  12. External Validation
  13. Recommendation Engine
  14. Natural Language Processing
  15. Ensemble Techniques
  16. Network Analytics/ GeoSpatial Analytics
  17. Forecasting Techniques
  18. Regularization
  19. Association Rules
  20. Batch Normalization
  21. Batch Size Selection
  22. Evaluation Metrics
  23. Regularization Techniques
  24. Clustering
  25. Blackbox - Neural Network Models
  26. Multiclass Classification Techniques
  27. Cross-Validation
  28. Weight Initialization
  29. Transfer Learning
  30. Regression Analysis
  31. AutoML
  32. Data Augmentation
  33. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  34. Cross-Validation
  1. Kafka Brokers
  2. model registry
  3. Apache Airflow
  4. Evidently.ai
  5. Data Preprocessing pipeline models
  6. code repository
  7. Datawarehouse
  8. Databases
  9. Github
  10. Github Actions
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)