Workflow Element Store

  1. Polynomial Features
  2. Binning / Discretization
  3. Annotation
  4. Textual Feature Extraction
  5. Data Scaling and Normalization
  6. Handling Imbalanced Classes
  7. Time-Based Features
  8. Dimensionality Reduction
  9. Handling Time-Series Data
  10. Auto-Preprocessing libraries
  11. Data Transformations
  12. Data Partitioning - Train, Validation, & Test
  13. Augmentation
  14. Dealing with Outliers
  15. Feature Extraction from Images
  16. Domain-Specific Feature Engineering
  17. Handling Categorical Data
  18. Feature Selection
  19. AutoEDA libraries
  20. Interaction Features
  21. Handling Noisy Data
  22. Handling Missing Data
  1. Learning Rate Scheduling
  2. Batch Normalization
  3. Blackbox - Neural Network Models
  4. Weight Initialization
  5. AutoML
  6. Cross-Validation
  7. Hyperparameter Tuning
  8. Binary Classification Techniques
  9. Recommendation Engine
  10. Performance Visualization
  11. Regularization
  12. External Validation
  13. Multiclass Classification Techniques
  14. Regular Monitoring and Logging
  15. Transfer Learning
  16. Natural Language Processing
  17. Association Rules
  18. Forecasting Techniques
  19. Transfer Learning
  20. Batch Size Selection
  21. Evaluation Metrics
  22. Cross-Validation
  23. Clustering
  24. Data Augmentation
  25. Network Analytics/ GeoSpatial Analytics
  26. Model Comparison
  27. Model Interpretability
  28. Regression Analysis
  29. Early Stopping
  30. Reinforcement Learning
  31. Regularization Techniques
  32. Word Embeddings
  33. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  34. Ensemble Techniques
  1. model registry
  2. Github
  3. Data Preprocessing pipeline models
  4. Databases
  5. Github Actions
  6. code repository
  7. Evidently.ai
  8. Kafka Brokers
  9. Datawarehouse
  10. Apache Airflow
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)