Workflow Element Store

  1. Data Generation
  2. APIs and Data Feeds
  3. Data Collaboration and Partnerships
  4. Data Pre-existing
  5. Data Logging
  6. Mobile Applications or IoT Applications
  7. Crowdsourcing
  8. Unstructured data (Images / Videos)
  9. Surveys and Questionnaires
  10. Unstructured data (Audio)
  11. Public Datasets
  12. Structured Data (Tabular)
  13. WebScraping
  1. Informatica
  2. MS SQL server
  3. Oracle DB
  4. Azure blob storage
  5. NoSQL DB
  6. RDBMS
  7. AWS Redshift
  8. PostgreSQL
  9. GCP BigQuery
  10. Azure Data Warehouse
  11. MySQL
  12. S3
  13. GCS
  1. Interaction Features
  2. Handling Noisy Data
  3. Dimensionality Reduction
  4. Feature Extraction from Images
  5. Time-Based Features
  6. Textual Feature Extraction
  7. Polynomial Features
  8. Handling Categorical Data
  9. Data Scaling and Normalization
  10. AutoEDA libraries
  11. Handling Time-Series Data
  12. Dimensionality Reduction
  13. Logarithmic Transform
  14. Dealing with Outliers
  15. Handling Imbalanced Classes
  16. Feature Selection
  17. Binning
  18. Data Scaling and Normalization
  19. Auto-Preprocessing libraries
  20. Domain-Specific Feature Engineering
  21. Encoding Categorical Variables
  22. Handling Missing Data
  1. Supervised Learning-Regression
  2. Blackbox Techniques
  3. Supervised Learning-multiclass classification
  4. Forecasting
  5. Train-Test Split
  6. Ensemble Techniques
  7. Supervised Learning-binary classification
  8. Time Series Anaysis
  9. Data Partitioning
  10. Unsupervised Learning
  1. Data Augmentation
  2. Cross-Validation
  3. Batch Size Selection
  4. Regular Monitoring and Logging
  5. Data Partition-sequential
  6. Gradient Clipping
  7. Train-Test Split
  8. Hyperparameter Tuning
  9. Early Stopping
  10. Batch Normalization
  11. Transfer Learning
  12. Ensemble Methods
  13. Learning Rate Scheduling
  14. Regularization
  15. Weight Initialization
  1. Evaluation Metrics
  2. Data Partitioning
  3. Cross-Validation
  4. Hyperparameter Tuning
  5. Performance Visualization
  6. Train-Test Split
  7. External Validation
  8. Regularization Techniques
  9. Model Comparison
  10. Model Interpretability
  1. Bias and Fairness Assessment
  2. Model Drift
  3. Monitoring and Logging
  4. A/B Testing
  5. Error Analysis
  6. Alerting and Notification
  7. Documentation and API Documentation
  8. Model Versioning
  9. Model Registry
  10. Feedback Collection
  11. Model Retraining and Updating
  12. Streamlit
  13. Web APIs - Flask, FastAPI, etc.
  14. Security Considerations
  15. Continuous Integration and Deployment (CI/CD)
  16. Edge Deployment
  17. Model Health Monitoring
  18. Data Drift Monitoring
  19. Serverless Computing
  20. Performance Metrics
  21. Cloud Deployment
  22. Documentation and Reporting
  23. Concept Drift Detection
  24. Model Serialization
  25. Model Monitoring and Maintenance
  26. Prediction Logging
  27. Containerization
  1. Mobile
  2. End User Machine
ML Workflow Beginner - Architecture
  • Element belongs to model
  • Element not belongs to model
Feature Store

Feature Store
(Online / Offline)

Data Sources

Data Sources

Data Warehouse

Data Warehouse/ Data Lake

Data Pre Processing & Feature Engineering

EDA, Data Pre Processing & Feature Engineering

Model Selection

Model Selection

Model Training & Hyper Parameter Tuning

Model Training & Hyper Parameter Tuning

Model Evaluation

Model Evaluation

Model Deployment

Model Deployment

End User Device

End User Device

Model Registry

Model Registry