Workflow Element Store

  1. Handling Noisy Data
  2. Data Transformations
  3. Auto-Preprocessing libraries
  4. Handling Time-Series Data
  5. Feature Selection
  6. Handling Missing Data
  7. Feature Extraction from Images
  8. Data Scaling and Normalization
  9. Augmentation
  10. Handling Imbalanced Classes
  11. Polynomial Features
  12. Textual Feature Extraction
  13. Interaction Features
  14. Handling Categorical Data
  15. Time-Based Features
  16. Domain-Specific Feature Engineering
  17. Dimensionality Reduction
  18. Dealing with Outliers
  19. Annotation
  20. Binning / Discretization
  21. AutoEDA libraries
  22. Data Partitioning - Train, Validation, & Test
  1. AutoML
  2. Recommendation Engine
  3. Performance Visualization
  4. Batch Size Selection
  5. Binary Classification Techniques
  6. Data Augmentation
  7. Reinforcement Learning
  8. Early Stopping
  9. Cross-Validation
  10. Weight Initialization
  11. Regular Monitoring and Logging
  12. Model Comparison
  13. Ensemble Techniques
  14. Transfer Learning
  15. Transfer Learning
  16. External Validation
  17. Multiclass Classification Techniques
  18. Learning Rate Scheduling
  19. Regularization Techniques
  20. Regularization
  21. Blackbox - Neural Network Models
  22. Regression Analysis
  23. Hyperparameter Tuning
  24. Batch Normalization
  25. Clustering
  26. Network Analytics/ GeoSpatial Analytics
  27. Forecasting Techniques
  28. Natural Language Processing
  29. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  30. Association Rules
  31. Word Embeddings
  32. Model Interpretability
  33. Evaluation Metrics
  34. Cross-Validation
  1. Github
  2. Kafka Brokers
  3. Evidently.ai
  4. Datawarehouse
  5. Data Preprocessing pipeline models
  6. Github Actions
  7. code repository
  8. Databases
  9. Apache Airflow
  10. model registry
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model

Data Sources

Streaming Data

Batch Data

Cloud Storage

Labeled Data

Feature Engineering Pipeline

Experimentation

ML Model

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Orchestration Component

Artifact Store

CI/CD Component

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)