Workflow Element Store

  1. Time-Based Features
  2. Handling Missing Data
  3. Annotation
  4. Feature Extraction from Images
  5. Handling Categorical Data
  6. Binning / Discretization
  7. Data Scaling and Normalization
  8. Augmentation
  9. Domain-Specific Feature Engineering
  10. Handling Time-Series Data
  11. Handling Imbalanced Classes
  12. Data Transformations
  13. Handling Noisy Data
  14. Interaction Features
  15. Dealing with Outliers
  16. Data Partitioning - Train, Validation, & Test
  17. AutoEDA libraries
  18. Feature Selection
  19. Polynomial Features
  20. Auto-Preprocessing libraries
  21. Textual Feature Extraction
  22. Dimensionality Reduction
  1. Performance Visualization
  2. Binary Classification Techniques
  3. Transfer Learning
  4. Cross-Validation
  5. Hyperparameter Tuning
  6. Recommendation Engine
  7. Batch Normalization
  8. Regression Analysis
  9. Learning Rate Scheduling
  10. Blackbox - Neural Network Models
  11. Cross-Validation
  12. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  13. Network Analytics/ GeoSpatial Analytics
  14. Model Comparison
  15. Evaluation Metrics
  16. Clustering
  17. Natural Language Processing
  18. Association Rules
  19. Regularization Techniques
  20. Weight Initialization
  21. Reinforcement Learning
  22. Multiclass Classification Techniques
  23. Ensemble Techniques
  24. External Validation
  25. Transfer Learning
  26. Early Stopping
  27. Word Embeddings
  28. Batch Size Selection
  29. Model Interpretability
  30. Forecasting Techniques
  31. Data Augmentation
  32. Regularization
  33. AutoML
  34. Regular Monitoring and Logging
  1. Github
  2. Databases
  3. Data Preprocessing pipeline models
  4. code repository
  5. Apache Airflow
  6. Kafka Brokers
  7. Datawarehouse
  8. Github Actions
  9. model registry
  10. Evidently.ai
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)