Workflow Element Store

  1. Time-Based Features
  2. AutoEDA libraries
  3. Feature Selection
  4. Feature Extraction from Images
  5. Handling Missing Data
  6. Interaction Features
  7. Data Partitioning - Train, Validation, & Test
  8. Dimensionality Reduction
  9. Binning / Discretization
  10. Textual Feature Extraction
  11. Data Transformations
  12. Dealing with Outliers
  13. Data Scaling and Normalization
  14. Domain-Specific Feature Engineering
  15. Annotation
  16. Auto-Preprocessing libraries
  17. Handling Noisy Data
  18. Handling Imbalanced Classes
  19. Handling Time-Series Data
  20. Polynomial Features
  21. Handling Categorical Data
  22. Augmentation
  1. Forecasting Techniques
  2. Hyperparameter Tuning
  3. Model Comparison
  4. Association Rules
  5. Word Embeddings
  6. Ensemble Techniques
  7. Performance Visualization
  8. Regular Monitoring and Logging
  9. Learning Rate Scheduling
  10. Multiclass Classification Techniques
  11. AutoML
  12. Batch Size Selection
  13. Cross-Validation
  14. Clustering
  15. Recommendation Engine
  16. Regularization Techniques
  17. External Validation
  18. Data Augmentation
  19. Model Interpretability
  20. Early Stopping
  21. Evaluation Metrics
  22. Batch Normalization
  23. Blackbox - Neural Network Models
  24. Cross-Validation
  25. Regularization
  26. Binary Classification Techniques
  27. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  28. Weight Initialization
  29. Network Analytics/ GeoSpatial Analytics
  30. Reinforcement Learning
  31. Regression Analysis
  32. Transfer Learning
  33. Transfer Learning
  34. Natural Language Processing
  1. Datawarehouse
  2. Kafka Brokers
  3. code repository
  4. Evidently.ai
  5. model registry
  6. Data Preprocessing pipeline models
  7. Github Actions
  8. Databases
  9. Apache Airflow
  10. Github
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)