Workflow Element Store

  1. Data Generation
  2. Mobile Applications or IoT Applications
  3. Data Collaboration and Partnerships
  4. Unstructured data (Images / Videos)
  5. APIs and Data Feeds
  6. Public Datasets
  7. Data Pre-existing
  8. Unstructured data (Audio)
  9. Surveys and Questionnaires
  10. WebScraping
  11. Crowdsourcing
  12. Data Logging
  13. Structured Data (Tabular)
  1. RDBMS
  2. S3
  3. GCS
  4. Oracle DB
  5. AWS Redshift
  6. MySQL
  7. PostgreSQL
  8. Informatica
  9. Azure blob storage
  10. MS SQL server
  11. Azure Data Warehouse
  12. NoSQL DB
  13. GCP BigQuery
  1. AutoEDA libraries
  2. Handling Time-Series Data
  3. Domain-Specific Feature Engineering
  4. Handling Missing Data
  5. Encoding Categorical Variables
  6. Binning
  7. Handling Noisy Data
  8. Logarithmic Transform
  9. Textual Feature Extraction
  10. Dealing with Outliers
  11. Polynomial Features
  12. Handling Imbalanced Classes
  13. Data Scaling and Normalization
  14. Feature Selection
  15. Dimensionality Reduction
  16. Interaction Features
  17. Time-Based Features
  18. Handling Categorical Data
  19. Feature Extraction from Images
  20. Dimensionality Reduction
  21. Auto-Preprocessing libraries
  22. Data Scaling and Normalization
  1. Supervised Learning-binary classification
  2. Unsupervised Learning
  3. Blackbox Techniques
  4. Data Partitioning
  5. Ensemble Techniques
  6. Time Series Anaysis
  7. Forecasting
  8. Train-Test Split
  9. Supervised Learning-multiclass classification
  10. Supervised Learning-Regression
  1. Regularization
  2. Cross-Validation
  3. Train-Test Split
  4. Batch Size Selection
  5. Batch Normalization
  6. Ensemble Methods
  7. Learning Rate Scheduling
  8. Transfer Learning
  9. Weight Initialization
  10. Regular Monitoring and Logging
  11. Hyperparameter Tuning
  12. Gradient Clipping
  13. Data Augmentation
  14. Data Partition-sequential
  15. Early Stopping
  1. Model Interpretability
  2. External Validation
  3. Data Partitioning
  4. Evaluation Metrics
  5. Performance Visualization
  6. Regularization Techniques
  7. Train-Test Split
  8. Model Comparison
  9. Cross-Validation
  10. Hyperparameter Tuning
  1. Model Retraining and Updating
  2. Cloud Deployment
  3. Bias and Fairness Assessment
  4. Error Analysis
  5. Model Drift
  6. Data Drift Monitoring
  7. Containerization
  8. Security Considerations
  9. Prediction Logging
  10. Alerting and Notification
  11. A/B Testing
  12. Feedback Collection
  13. Documentation and Reporting
  14. Serverless Computing
  15. Concept Drift Detection
  16. Performance Metrics
  17. Model Registry
  18. Edge Deployment
  19. Web APIs - Flask, FastAPI, etc.
  20. Continuous Integration and Deployment (CI/CD)
  21. Monitoring and Logging
  22. Model Monitoring and Maintenance
  23. Model Versioning
  24. Model Serialization
  25. Model Health Monitoring
  26. Documentation and API Documentation
  27. Streamlit
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry