Workflow Element Store

  1. Data Pre-existing
  2. Mobile Applications or IoT Applications
  3. Surveys and Questionnaires
  4. Data Generation
  5. Crowdsourcing
  6. Structured Data (Tabular)
  7. Unstructured data (Audio)
  8. APIs and Data Feeds
  9. Unstructured data (Images / Videos)
  10. Data Collaboration and Partnerships
  11. WebScraping
  12. Public Datasets
  13. Data Logging
  1. S3
  2. PostgreSQL
  3. Informatica
  4. RDBMS
  5. Azure Data Warehouse
  6. AWS Redshift
  7. Azure blob storage
  8. MS SQL server
  9. Oracle DB
  10. GCS
  11. GCP BigQuery
  12. NoSQL DB
  13. MySQL
  1. Dealing with Outliers
  2. Textual Feature Extraction
  3. Time-Based Features
  4. Polynomial Features
  5. AutoEDA libraries
  6. Dimensionality Reduction
  7. Logarithmic Transform
  8. Handling Missing Data
  9. Auto-Preprocessing libraries
  10. Data Scaling and Normalization
  11. Feature Selection
  12. Handling Time-Series Data
  13. Interaction Features
  14. Binning
  15. Handling Categorical Data
  16. Encoding Categorical Variables
  17. Domain-Specific Feature Engineering
  18. Handling Imbalanced Classes
  19. Handling Noisy Data
  20. Feature Extraction from Images
  21. Data Scaling and Normalization
  22. Dimensionality Reduction
  1. Unsupervised Learning
  2. Forecasting
  3. Supervised Learning-Regression
  4. Time Series Anaysis
  5. Supervised Learning-multiclass classification
  6. Train-Test Split
  7. Ensemble Techniques
  8. Data Partitioning
  9. Blackbox Techniques
  10. Supervised Learning-binary classification
  1. Regularization
  2. Cross-Validation
  3. Hyperparameter Tuning
  4. Weight Initialization
  5. Train-Test Split
  6. Gradient Clipping
  7. Transfer Learning
  8. Regular Monitoring and Logging
  9. Data Augmentation
  10. Batch Size Selection
  11. Batch Normalization
  12. Data Partition-sequential
  13. Early Stopping
  14. Ensemble Methods
  15. Learning Rate Scheduling
  1. Model Comparison
  2. Model Interpretability
  3. Data Partitioning
  4. Hyperparameter Tuning
  5. External Validation
  6. Performance Visualization
  7. Evaluation Metrics
  8. Train-Test Split
  9. Cross-Validation
  10. Regularization Techniques
  1. Bias and Fairness Assessment
  2. Continuous Integration and Deployment (CI/CD)
  3. Web APIs - Flask, FastAPI, etc.
  4. Performance Metrics
  5. Concept Drift Detection
  6. Feedback Collection
  7. Serverless Computing
  8. Documentation and API Documentation
  9. Security Considerations
  10. Model Versioning
  11. Model Health Monitoring
  12. Model Retraining and Updating
  13. Model Monitoring and Maintenance
  14. Data Drift Monitoring
  15. Error Analysis
  16. Model Registry
  17. A/B Testing
  18. Edge Deployment
  19. Model Serialization
  20. Containerization
  21. Alerting and Notification
  22. Model Drift
  23. Cloud Deployment
  24. Monitoring and Logging
  25. Prediction Logging
  26. Streamlit
  27. Documentation and Reporting
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry