Workflow Element Store

  1. Handling Missing Data
  2. Feature Selection
  3. Handling Time-Series Data
  4. Data Partitioning - Train, Validation, & Test
  5. Dealing with Outliers
  6. Textual Feature Extraction
  7. Binning / Discretization
  8. Time-Based Features
  9. Annotation
  10. Dimensionality Reduction
  11. Polynomial Features
  12. Handling Imbalanced Classes
  13. Data Scaling and Normalization
  14. Handling Categorical Data
  15. Augmentation
  16. Data Transformations
  17. Domain-Specific Feature Engineering
  18. Handling Noisy Data
  19. AutoEDA libraries
  20. Interaction Features
  21. Auto-Preprocessing libraries
  22. Feature Extraction from Images
  1. Performance Visualization
  2. Clustering
  3. Regression Analysis
  4. Reinforcement Learning
  5. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  6. Data Augmentation
  7. Weight Initialization
  8. Hyperparameter Tuning
  9. Batch Size Selection
  10. Transfer Learning
  11. Association Rules
  12. AutoML
  13. Model Comparison
  14. Evaluation Metrics
  15. Natural Language Processing
  16. Multiclass Classification Techniques
  17. Regularization
  18. Recommendation Engine
  19. Model Interpretability
  20. Batch Normalization
  21. Word Embeddings
  22. Cross-Validation
  23. Forecasting Techniques
  24. Regular Monitoring and Logging
  25. Ensemble Techniques
  26. Early Stopping
  27. Regularization Techniques
  28. Network Analytics/ GeoSpatial Analytics
  29. External Validation
  30. Learning Rate Scheduling
  31. Binary Classification Techniques
  32. Cross-Validation
  33. Transfer Learning
  34. Blackbox - Neural Network Models
  1. Evidently.ai
  2. Kafka Brokers
  3. Github Actions
  4. code repository
  5. Datawarehouse
  6. Apache Airflow
  7. Github
  8. Data Preprocessing pipeline models
  9. model registry
  10. Databases
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)