Workflow Element Store

  1. Feature Extraction from Images
  2. Polynomial Features
  3. Binning / Discretization
  4. Data Scaling and Normalization
  5. Handling Noisy Data
  6. Auto-Preprocessing libraries
  7. Dimensionality Reduction
  8. Handling Categorical Data
  9. Annotation
  10. Handling Imbalanced Classes
  11. Textual Feature Extraction
  12. Data Partitioning - Train, Validation, & Test
  13. Handling Missing Data
  14. Data Transformations
  15. Interaction Features
  16. Time-Based Features
  17. Dealing with Outliers
  18. Domain-Specific Feature Engineering
  19. AutoEDA libraries
  20. Feature Selection
  21. Augmentation
  22. Handling Time-Series Data
  1. Transfer Learning
  2. Performance Visualization
  3. Natural Language Processing
  4. Regularization Techniques
  5. Cross-Validation
  6. Early Stopping
  7. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  8. Transfer Learning
  9. Blackbox - Neural Network Models
  10. Forecasting Techniques
  11. Clustering
  12. Word Embeddings
  13. Multiclass Classification Techniques
  14. Evaluation Metrics
  15. Hyperparameter Tuning
  16. AutoML
  17. Model Interpretability
  18. Recommendation Engine
  19. Batch Normalization
  20. Reinforcement Learning
  21. Batch Size Selection
  22. Ensemble Techniques
  23. Regular Monitoring and Logging
  24. Cross-Validation
  25. Binary Classification Techniques
  26. Regression Analysis
  27. Association Rules
  28. Model Comparison
  29. External Validation
  30. Weight Initialization
  31. Learning Rate Scheduling
  32. Network Analytics/ GeoSpatial Analytics
  33. Regularization
  34. Data Augmentation
  1. Databases
  2. model registry
  3. Evidently.ai
  4. Data Preprocessing pipeline models
  5. code repository
  6. Github Actions
  7. Github
  8. Datawarehouse
  9. Apache Airflow
  10. Kafka Brokers
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model
Data Sources

Data Sources

Streaming Data

Streaming Data

Batch Data

Batch Data

Cloud Storage

Cloud Storage

Labeled Data

Labeled Data

Feature Engineering Pipeline

Feature Engineering Pipeline

Experimentation

Experimentation

ML Model

ML Model

Repository

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Offline DB Online DB

Orchestration Component

Artifact Store

CI/CD Component

Model Registry

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Automation ML Workflow Pipeline

Monitoring Component

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)