Workflow Element Store

  1. APIs and Data Feeds
  2. Data Collaboration and Partnerships
  3. WebScraping
  4. Mobile Applications or IoT Applications
  5. Crowdsourcing
  6. Data Generation
  7. Data Pre-existing
  8. Unstructured data (Images / Videos)
  9. Unstructured data (Audio)
  10. Data Logging
  11. Surveys and Questionnaires
  12. Public Datasets
  13. Structured Data (Tabular)
  1. S3
  2. NoSQL DB
  3. Azure Data Warehouse
  4. GCS
  5. PostgreSQL
  6. GCP BigQuery
  7. Oracle DB
  8. MS SQL server
  9. Azure blob storage
  10. AWS Redshift
  11. MySQL
  12. RDBMS
  13. Informatica
  1. Binning
  2. Encoding Categorical Variables
  3. Logarithmic Transform
  4. Interaction Features
  5. Feature Extraction from Images
  6. AutoEDA libraries
  7. Dealing with Outliers
  8. Handling Missing Data
  9. Handling Time-Series Data
  10. Dimensionality Reduction
  11. Textual Feature Extraction
  12. Handling Categorical Data
  13. Handling Imbalanced Classes
  14. Handling Noisy Data
  15. Data Scaling and Normalization
  16. Data Scaling and Normalization
  17. Dimensionality Reduction
  18. Domain-Specific Feature Engineering
  19. Auto-Preprocessing libraries
  20. Time-Based Features
  21. Polynomial Features
  22. Feature Selection
  1. Forecasting
  2. Ensemble Techniques
  3. Supervised Learning-Regression
  4. Blackbox Techniques
  5. Unsupervised Learning
  6. Supervised Learning-multiclass classification
  7. Supervised Learning-binary classification
  8. Data Partitioning
  9. Time Series Anaysis
  10. Train-Test Split
  1. Data Augmentation
  2. Ensemble Methods
  3. Cross-Validation
  4. Transfer Learning
  5. Batch Size Selection
  6. Batch Normalization
  7. Data Partition-sequential
  8. Regular Monitoring and Logging
  9. Regularization
  10. Train-Test Split
  11. Learning Rate Scheduling
  12. Early Stopping
  13. Weight Initialization
  14. Hyperparameter Tuning
  15. Gradient Clipping
  1. Train-Test Split
  2. Performance Visualization
  3. Hyperparameter Tuning
  4. Evaluation Metrics
  5. Regularization Techniques
  6. Model Comparison
  7. External Validation
  8. Data Partitioning
  9. Cross-Validation
  10. Model Interpretability
  1. Alerting and Notification
  2. Concept Drift Detection
  3. Edge Deployment
  4. Security Considerations
  5. Bias and Fairness Assessment
  6. Documentation and API Documentation
  7. Model Registry
  8. Performance Metrics
  9. Containerization
  10. Model Serialization
  11. Documentation and Reporting
  12. Monitoring and Logging
  13. Data Drift Monitoring
  14. Serverless Computing
  15. Continuous Integration and Deployment (CI/CD)
  16. Prediction Logging
  17. Web APIs - Flask, FastAPI, etc.
  18. Model Versioning
  19. Error Analysis
  20. Model Drift
  21. Model Health Monitoring
  22. Cloud Deployment
  23. A/B Testing
  24. Model Monitoring and Maintenance
  25. Model Retraining and Updating
  26. Feedback Collection
  27. Streamlit
  1. Mobile
  2. End User Machine
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry