Workflow Element Store

  1. Textual Feature Extraction
  2. AutoEDA libraries
  3. Interaction Features
  4. Polynomial Features
  5. Domain-Specific Feature Engineering
  6. Annotation
  7. Feature Selection
  8. Data Scaling and Normalization
  9. Auto-Preprocessing libraries
  10. Data Transformations
  11. Handling Missing Data
  12. Binning / Discretization
  13. Data Partitioning - Train, Validation, & Test
  14. Feature Extraction from Images
  15. Time-Based Features
  16. Handling Categorical Data
  17. Handling Time-Series Data
  18. Augmentation
  19. Dimensionality Reduction
  20. Dealing with Outliers
  21. Handling Imbalanced Classes
  22. Handling Noisy Data
  1. Transfer Learning
  2. Blackbox - Neural Network Models
  3. Network Analytics/ GeoSpatial Analytics
  4. Batch Size Selection
  5. Regression Analysis
  6. Transfer Learning
  7. Word Embeddings
  8. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  9. Performance Visualization
  10. Forecasting Techniques
  11. Evaluation Metrics
  12. AutoML
  13. Reinforcement Learning
  14. Hyperparameter Tuning
  15. Regularization
  16. Early Stopping
  17. Clustering
  18. Recommendation Engine
  19. Multiclass Classification Techniques
  20. Learning Rate Scheduling
  21. Binary Classification Techniques
  22. Data Augmentation
  23. Ensemble Techniques
  24. Association Rules
  25. Cross-Validation
  26. Regularization Techniques
  27. Natural Language Processing
  28. Cross-Validation
  29. Model Interpretability
  30. Model Comparison
  31. External Validation
  32. Regular Monitoring and Logging
  33. Batch Normalization
  34. Weight Initialization
  1. Data Preprocessing pipeline models
  2. Evidently.ai
  3. Apache Airflow
  4. Databases
  5. Github Actions
  6. Github
  7. code repository
  8. model registry
  9. Datawarehouse
  10. Kafka Brokers
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)