Workflow Element Store

  1. APIs and Data Feeds
  2. Data Generation
  3. Public Datasets
  4. Unstructured data (Audio)
  5. Unstructured data (Images / Videos)
  6. Structured Data (Tabular)
  7. Surveys and Questionnaires
  8. Data Collaboration and Partnerships
  9. Mobile Applications or IoT Applications
  10. Crowdsourcing
  11. Data Logging
  12. Data Pre-existing
  13. WebScraping
  1. RDBMS
  2. Azure blob storage
  3. S3
  4. GCS
  5. Oracle DB
  6. GCP BigQuery
  7. MS SQL server
  8. Informatica
  9. PostgreSQL
  10. AWS Redshift
  11. Azure Data Warehouse
  12. NoSQL DB
  13. MySQL
  1. Handling Time-Series Data
  2. Dimensionality Reduction
  3. Handling Imbalanced Classes
  4. Time-Based Features
  5. Feature Extraction from Images
  6. Auto-Preprocessing libraries
  7. Handling Missing Data
  8. Handling Categorical Data
  9. Encoding Categorical Variables
  10. Domain-Specific Feature Engineering
  11. Textual Feature Extraction
  12. Data Scaling and Normalization
  13. Polynomial Features
  14. Dealing with Outliers
  15. AutoEDA libraries
  16. Feature Selection
  17. Handling Noisy Data
  18. Dimensionality Reduction
  19. Interaction Features
  20. Logarithmic Transform
  21. Data Scaling and Normalization
  22. Binning
  1. Train-Test Split
  2. Supervised Learning-multiclass classification
  3. Time Series Anaysis
  4. Supervised Learning-Regression
  5. Ensemble Techniques
  6. Forecasting
  7. Unsupervised Learning
  8. Supervised Learning-binary classification
  9. Data Partitioning
  10. Blackbox Techniques
  1. Data Augmentation
  2. Learning Rate Scheduling
  3. Batch Normalization
  4. Weight Initialization
  5. Regularization
  6. Gradient Clipping
  7. Train-Test Split
  8. Hyperparameter Tuning
  9. Cross-Validation
  10. Batch Size Selection
  11. Ensemble Methods
  12. Data Partition-sequential
  13. Early Stopping
  14. Regular Monitoring and Logging
  15. Transfer Learning
  1. Evaluation Metrics
  2. Cross-Validation
  3. Data Partitioning
  4. Train-Test Split
  5. External Validation
  6. Performance Visualization
  7. Regularization Techniques
  8. Model Interpretability
  9. Model Comparison
  10. Hyperparameter Tuning
  1. Cloud Deployment
  2. Model Versioning
  3. Documentation and API Documentation
  4. Web APIs - Flask, FastAPI, etc.
  5. Streamlit
  6. Serverless Computing
  7. Bias and Fairness Assessment
  8. Model Health Monitoring
  9. Security Considerations
  10. Feedback Collection
  11. Model Registry
  12. Continuous Integration and Deployment (CI/CD)
  13. Edge Deployment
  14. Model Retraining and Updating
  15. Performance Metrics
  16. A/B Testing
  17. Containerization
  18. Model Monitoring and Maintenance
  19. Model Serialization
  20. Concept Drift Detection
  21. Documentation and Reporting
  22. Data Drift Monitoring
  23. Model Drift
  24. Monitoring and Logging
  25. Alerting and Notification
  26. Error Analysis
  27. Prediction Logging
  1. End User Machine
  2. Mobile
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry