Workflow Element Store

  1. Handling Missing Data
  2. Interaction Features
  3. Domain-Specific Feature Engineering
  4. Handling Imbalanced Classes
  5. Feature Extraction from Images
  6. Dealing with Outliers
  7. Polynomial Features
  8. Handling Categorical Data
  9. Data Transformations
  10. Handling Time-Series Data
  11. Annotation
  12. Time-Based Features
  13. Textual Feature Extraction
  14. Binning / Discretization
  15. Feature Selection
  16. Data Partitioning - Train, Validation, & Test
  17. Augmentation
  18. Data Scaling and Normalization
  19. Handling Noisy Data
  20. Dimensionality Reduction
  21. AutoEDA libraries
  22. Auto-Preprocessing libraries
  1. Model Comparison
  2. Transfer Learning
  3. Cross-Validation
  4. Transfer Learning
  5. Regression Analysis
  6. Blackbox - Neural Network Models
  7. Weight Initialization
  8. Recommendation Engine
  9. GridSearchCV, RandomisedSearchCV, BayesianSearchCV
  10. Association Rules
  11. Multiclass Classification Techniques
  12. Forecasting Techniques
  13. External Validation
  14. Early Stopping
  15. Network Analytics/ GeoSpatial Analytics
  16. Model Interpretability
  17. Data Augmentation
  18. Hyperparameter Tuning
  19. Regularization Techniques
  20. Binary Classification Techniques
  21. Regularization
  22. Cross-Validation
  23. Batch Normalization
  24. Reinforcement Learning
  25. Evaluation Metrics
  26. Word Embeddings
  27. Natural Language Processing
  28. Ensemble Techniques
  29. Regular Monitoring and Logging
  30. Batch Size Selection
  31. Performance Visualization
  32. Learning Rate Scheduling
  33. Clustering
  34. AutoML
  1. Kafka Brokers
  2. Evidently.ai
  3. model registry
  4. Github Actions
  5. Datawarehouse
  6. Github
  7. Databases
  8. Data Preprocessing pipeline models
  9. code repository
  10. Apache Airflow
ML Workflow Advanced - Architecture
  • Element belongs to model
  • Element not belongs to model

Data Sources

Streaming Data

Batch Data

Cloud Storage

Labeled Data

Feature Engineering Pipeline

Experimentation

ML Model

Repository

CI/CD component

Continuous integration/Continuous delivery

Continuous deployment

Artifact Store

Feature Store System

Orchestration Component

Artifact Store

CI/CD Component

Scheduler

Workflow orchestration component

Automation ML Workflow Pipeline

Monitoring Component

Model serving component

(Prediction on new batch or streaming data)