Workflow Element Store

  1. Data Logging
  2. Data Generation
  3. Public Datasets
  4. Surveys and Questionnaires
  5. Data Collaboration and Partnerships
  6. WebScraping
  7. Mobile Applications or IoT Applications
  8. Structured Data (Tabular)
  9. Crowdsourcing
  10. Unstructured data (Audio)
  11. Unstructured data (Images / Videos)
  12. Data Pre-existing
  13. APIs and Data Feeds
  1. MySQL
  2. AWS Redshift
  3. PostgreSQL
  4. Oracle DB
  5. NoSQL DB
  6. Azure blob storage
  7. GCS
  8. GCP BigQuery
  9. MS SQL server
  10. RDBMS
  11. Informatica
  12. S3
  13. Azure Data Warehouse
  1. AutoEDA libraries
  2. Handling Noisy Data
  3. Dimensionality Reduction
  4. Data Scaling and Normalization
  5. Encoding Categorical Variables
  6. Dimensionality Reduction
  7. Handling Imbalanced Classes
  8. Time-Based Features
  9. Domain-Specific Feature Engineering
  10. Feature Selection
  11. Handling Time-Series Data
  12. Textual Feature Extraction
  13. Dealing with Outliers
  14. Logarithmic Transform
  15. Auto-Preprocessing libraries
  16. Feature Extraction from Images
  17. Polynomial Features
  18. Handling Missing Data
  19. Data Scaling and Normalization
  20. Interaction Features
  21. Binning
  22. Handling Categorical Data
  1. Blackbox Techniques
  2. Ensemble Techniques
  3. Supervised Learning-Regression
  4. Unsupervised Learning
  5. Supervised Learning-multiclass classification
  6. Forecasting
  7. Time Series Anaysis
  8. Train-Test Split
  9. Data Partitioning
  10. Supervised Learning-binary classification
  1. Cross-Validation
  2. Train-Test Split
  3. Gradient Clipping
  4. Data Augmentation
  5. Transfer Learning
  6. Learning Rate Scheduling
  7. Regular Monitoring and Logging
  8. Ensemble Methods
  9. Batch Normalization
  10. Hyperparameter Tuning
  11. Data Partition-sequential
  12. Early Stopping
  13. Batch Size Selection
  14. Weight Initialization
  15. Regularization
  1. Regularization Techniques
  2. Performance Visualization
  3. Hyperparameter Tuning
  4. Cross-Validation
  5. External Validation
  6. Model Interpretability
  7. Evaluation Metrics
  8. Train-Test Split
  9. Model Comparison
  10. Data Partitioning
  1. Edge Deployment
  2. Documentation and API Documentation
  3. Containerization
  4. Data Drift Monitoring
  5. Security Considerations
  6. Error Analysis
  7. Model Serialization
  8. Concept Drift Detection
  9. Performance Metrics
  10. Cloud Deployment
  11. A/B Testing
  12. Serverless Computing
  13. Feedback Collection
  14. Streamlit
  15. Prediction Logging
  16. Model Retraining and Updating
  17. Model Drift
  18. Documentation and Reporting
  19. Bias and Fairness Assessment
  20. Model Health Monitoring
  21. Model Monitoring and Maintenance
  22. Model Registry
  23. Continuous Integration and Deployment (CI/CD)
  24. Model Versioning
  25. Web APIs - Flask, FastAPI, etc.
  26. Alerting and Notification
  27. Monitoring and Logging
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry