Workflow Element Store

  1. Data Collaboration and Partnerships
  2. Data Logging
  3. Data Generation
  4. Unstructured data (Images / Videos)
  5. Public Datasets
  6. Structured Data (Tabular)
  7. Crowdsourcing
  8. WebScraping
  9. APIs and Data Feeds
  10. Data Pre-existing
  11. Surveys and Questionnaires
  12. Unstructured data (Audio)
  13. Mobile Applications or IoT Applications
  1. MS SQL server
  2. PostgreSQL
  3. Oracle DB
  4. Informatica
  5. S3
  6. NoSQL DB
  7. MySQL
  8. GCP BigQuery
  9. Azure blob storage
  10. AWS Redshift
  11. RDBMS
  12. Azure Data Warehouse
  13. GCS
  1. Textual Feature Extraction
  2. Polynomial Features
  3. Binning
  4. Handling Categorical Data
  5. Domain-Specific Feature Engineering
  6. Feature Extraction from Images
  7. Auto-Preprocessing libraries
  8. Logarithmic Transform
  9. Data Scaling and Normalization
  10. Feature Selection
  11. Handling Imbalanced Classes
  12. Time-Based Features
  13. Dimensionality Reduction
  14. Interaction Features
  15. Handling Noisy Data
  16. Dealing with Outliers
  17. AutoEDA libraries
  18. Data Scaling and Normalization
  19. Dimensionality Reduction
  20. Handling Missing Data
  21. Encoding Categorical Variables
  22. Handling Time-Series Data
  1. Unsupervised Learning
  2. Blackbox Techniques
  3. Ensemble Techniques
  4. Time Series Anaysis
  5. Supervised Learning-multiclass classification
  6. Supervised Learning-Regression
  7. Train-Test Split
  8. Forecasting
  9. Supervised Learning-binary classification
  10. Data Partitioning
  1. Gradient Clipping
  2. Regularization
  3. Regular Monitoring and Logging
  4. Transfer Learning
  5. Hyperparameter Tuning
  6. Early Stopping
  7. Weight Initialization
  8. Batch Size Selection
  9. Cross-Validation
  10. Train-Test Split
  11. Batch Normalization
  12. Learning Rate Scheduling
  13. Data Augmentation
  14. Data Partition-sequential
  15. Ensemble Methods
  1. External Validation
  2. Regularization Techniques
  3. Data Partitioning
  4. Cross-Validation
  5. Train-Test Split
  6. Evaluation Metrics
  7. Model Comparison
  8. Model Interpretability
  9. Performance Visualization
  10. Hyperparameter Tuning
  1. Model Registry
  2. Alerting and Notification
  3. Performance Metrics
  4. Feedback Collection
  5. Prediction Logging
  6. Continuous Integration and Deployment (CI/CD)
  7. Edge Deployment
  8. Documentation and API Documentation
  9. Error Analysis
  10. A/B Testing
  11. Bias and Fairness Assessment
  12. Security Considerations
  13. Containerization
  14. Model Retraining and Updating
  15. Cloud Deployment
  16. Model Health Monitoring
  17. Streamlit
  18. Model Monitoring and Maintenance
  19. Monitoring and Logging
  20. Documentation and Reporting
  21. Concept Drift Detection
  22. Model Drift
  23. Model Serialization
  24. Web APIs - Flask, FastAPI, etc.
  25. Model Versioning
  26. Serverless Computing
  27. Data Drift Monitoring
  1. Mobile
  2. End User Machine
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry