Workflow Element Store

  1. Data Partitioning - Train, Validation, & Test
  2. Textual Feature Extraction
  3. Dealing with Outliers
  4. AutoEDA libraries
  5. Annotation
  6. Feature Selection
  7. Handling Categorical Data
  8. Augmentation
  9. Data Transformations
  10. Polynomial Features
  11. Time-Based Features
  12. Handling Imbalanced Classes
  13. Data Scaling and Normalization
  14. Feature Extraction from Images
  15. Handling Missing Data
  16. Binning / Discretization
  17. Handling Noisy Data
  18. Dimensionality Reduction
  19. Handling Time-Series Data
  20. Auto-Preprocessing libraries
  21. Interaction Features
  22. Domain-Specific Feature Engineering
  1. Regularization Techniques
  2. Clustering
  3. Ensemble Techniques
  4. Binary Classification Techniques
  5. Performance Visualization
  6. Model Interpretability
  7. Multiclass Classification Techniques
  8. Word Embeddings
  9. Weight Initialization
  10. Reinforcement Learning
  11. Regular Monitoring and Logging
  12. Evaluation Metrics
  13. Cross-Validation
  14. Regularization
  15. Batch Normalization
  16. Association Rules
  17. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  18. Regression Analysis
  19. Cross-Validation
  20. Hyperparameter Tuning
  21. Blackbox - Neural Network Models
  22. Data Augmentation
  23. Transfer Learning
  24. Transfer Learning
  25. Early Stopping
  26. Recommendation Engine
  27. Natural Language Processing
  28. AutoML
  29. Batch Size Selection
  30. Learning Rate Scheduling
  31. Forecasting Techniques
  32. Model Comparison
  33. External Validation
  34. Network Analytics/ GeoSpatial Analytics
  1. Apache Airflow
  2. Kafka Brokers
  3. model registry
  4. Datawarehouse
  5. Evidently.ai
  6. code repository
  7. Data Preprocessing pipeline models
  8. Github
  9. Databases
  10. Github Actions
ML Workflow - Architecture
  • Element belongs to model
  • Element not belongs to model

Data Sources

Streaming Data

Batch Data

Cloud Storage

Labeled Data

Feature Engineering Pipeline

Experimentation

ML Model

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Orchestration Component

Artifact Store

CI/CD Component

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)