Notes on AWS, Big Data, Machine Learning and Leadership: Amazon Machine Learning

Overview

"Model"

Supervised machine learning
- Training data (label: spam/not-spam) - max 100GB
- Tuned until receives desired accuracy

Data

Training Data

Process

Build model
- Create datasource
- Explore and understand your data
  - ML computes the statistics
- Create a model
  - Select data source
  - Model type
    - BINARY CLASSIFICATION
      - Yes/No
    - REGRESSION
      - Predicts a number
      - e.g. how much will this house sell for
    - MULTI CLASSIFICATION
      - Assign a category (e.g. genre)
  - Each model type has evaluation score
  - Reciple
    - Transformations applied to variables
Evaluate and optimize
- All model types have visualization
- You can tweak parameters
Retrieve predictions
- Batch: large volume prediction analysis
  - Async
  - They are output to S3
- Real-Time:
  - Sync
  - Low-latency

Use cases

Notes

Two modes
- Interactive (experimentation)
- API (automated access)
  - Batch predictions
  - Real-Time predictions

Notes on AWS, Big Data, Machine Learning and Leadership