Workflow Element Store

  1. Surveys and Questionnaires
  2. Data Collaboration and Partnerships
  3. Data Logging
  4. Unstructured data (Audio)
  5. WebScraping
  6. Crowdsourcing
  7. APIs and Data Feeds
  8. Structured Data (Tabular)
  9. Unstructured data (Images / Videos)
  10. Data Generation
  11. Data Pre-existing
  12. Public Datasets
  13. Mobile Applications or IoT Applications
  1. Azure blob storage
  2. NoSQL DB
  3. GCP BigQuery
  4. MySQL
  5. Informatica
  6. Oracle DB
  7. GCS
  8. S3
  9. AWS Redshift
  10. MS SQL server
  11. PostgreSQL
  12. Azure Data Warehouse
  13. RDBMS
  1. Interaction Features
  2. Data Scaling and Normalization
  3. Polynomial Features
  4. Time-Based Features
  5. Dimensionality Reduction
  6. AutoEDA libraries
  7. Domain-Specific Feature Engineering
  8. Binning
  9. Handling Imbalanced Classes
  10. Logarithmic Transform
  11. Handling Missing Data
  12. Feature Selection
  13. Encoding Categorical Variables
  14. Textual Feature Extraction
  15. Handling Noisy Data
  16. Feature Extraction from Images
  17. Auto-Preprocessing libraries
  18. Handling Categorical Data
  19. Dealing with Outliers
  20. Data Scaling and Normalization
  21. Dimensionality Reduction
  22. Handling Time-Series Data
  1. Unsupervised Learning
  2. Supervised Learning-Regression
  3. Supervised Learning-binary classification
  4. Ensemble Techniques
  5. Data Partitioning
  6. Blackbox Techniques
  7. Train-Test Split
  8. Forecasting
  9. Time Series Anaysis
  10. Supervised Learning-multiclass classification
  1. Regular Monitoring and Logging
  2. Ensemble Methods
  3. Data Partition-sequential
  4. Transfer Learning
  5. Cross-Validation
  6. Early Stopping
  7. Hyperparameter Tuning
  8. Batch Size Selection
  9. Train-Test Split
  10. Batch Normalization
  11. Gradient Clipping
  12. Regularization
  13. Learning Rate Scheduling
  14. Data Augmentation
  15. Weight Initialization
  1. Model Comparison
  2. Regularization Techniques
  3. Cross-Validation
  4. Evaluation Metrics
  5. Model Interpretability
  6. External Validation
  7. Performance Visualization
  8. Train-Test Split
  9. Data Partitioning
  10. Hyperparameter Tuning
  1. Monitoring and Logging
  2. Model Retraining and Updating
  3. Data Drift Monitoring
  4. Edge Deployment
  5. Feedback Collection
  6. Documentation and API Documentation
  7. Cloud Deployment
  8. Performance Metrics
  9. Bias and Fairness Assessment
  10. Alerting and Notification
  11. Continuous Integration and Deployment (CI/CD)
  12. Model Versioning
  13. Prediction Logging
  14. A/B Testing
  15. Error Analysis
  16. Model Health Monitoring
  17. Security Considerations
  18. Serverless Computing
  19. Model Registry
  20. Streamlit
  21. Model Monitoring and Maintenance
  22. Model Serialization
  23. Containerization
  24. Model Drift
  25. Documentation and Reporting
  26. Concept Drift Detection
  27. Web APIs - Flask, FastAPI, etc.
  1. Mobile
  2. End User Machine
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry