CSTalks - On machine learning - 2 Mar

transcript

On Machine Learning

at CSTalks

by Vlad Hosu

Introduction

Fundamental Questions

•What are the fundamental laws that govern all learning processes?

•How can we build computer systems that automatically improve with experience?

Learning: Method

•a process of adaption

•by which a parametric model is automatically adjusted

•so that some fitness criteria is more readily met

Before Learning

I’m learning, hence I need

adapt!

Result: Liony adjusts his

Biological Learning

•Model: nervous systemneuron connectivity, chemical changes etc

•Fitness: improved behaviorskills, memory, knowledge

Machine Learning

•a mathematical model

•with adjustable parameters

•optimizing some fitness function

Motivation

•some things are hard to code

•too much data

•automatic learning works better

•is easier to customize/personalize

Learning: Purpose

•estimation

•function - stock market

•class - recognition

•structure - grouping

Requirements

•good learning ability

•scalability to large problems

•simple and easy algorithm implementation

Things Ahead• Problems

• Clustering

• Classification

• Regression

• Learning issues

• importance of domain knowledge

• learning/generalization ability

• model complexity issues

• Optimization

Important Problems

Clustering

Classificationx1

Classification

•Types

•discriminative

•generative

Classification

•Types

•discriminative

•generative

Classification

Regression

Making Connections

•discrete value regression =>generative classification

•regression on boundary space => discriminative classification

•clustering + labels => classification

Learning Issues

Domain Knowledge

•exploitation of problem structure

•human abstractions are better

•important for picking the right model

Grouping in Images

•groups together similar parts of an image

•select objects

•find patterns

•features = pixel values (function of)

Segmentation

Color Space

Color Space (cont)

Suitable Clustering

Generalization Ability

•training data generalizes to new data

•important for classification accuracy

Support Vector Machines (SVM)

•linear classifier on distorted space

Learning Ability

overfittin

Problems with Over-fitting

SVM vs Decision Trees

Complexity Issues

•models should be

•as simple as possible

•but representative of the training data

Neural Networks

•model: weights

•fitness: output error

•general function ∑

Training a Network

Non-trivial Functions

Optimization

Optimizing Fitness

•find extrema

•strategies

•gradient descent

•convex optimization

Optimization

•finding extrema

•local/global

Gradient Descent

Problem: Local Extrema

Problem: Speed

Linear Programming

lines define a convex function

planes in 3D etc

Considerations

•scaling to large features spaces

•feature selection

•dimensionality reduction

Open Problems

•unlabeled data for regression

•exploiting sparsity in high dimensional spaces for non-parametric learning

•transferring learnt information from one task to simplify learning another

Open Problems (cont)

•algorithms for learning control strategies from delayed rewards and other inputs

•best “active learning” strategies for different learning problems

•degree one can preserve data privacy while obtaining the benefits of data mining

The endQuestions?

Types of Regression

•parametric

•non-parametric

Linear vs Non-linear

• linear

• smooth

• under-fitting

• good enough for some processes (biz)

• non-linear

• complex

• over-fitting

• works on most data-sets

Naive Bayes

write people

No.Good

No.Spam

Graph Clustering

Mean Shift

Problems in CV•What are the physical and geometric

processes that govern (digital) imaging?

•What are the “informative” areas of an image and how do we detect them?

•What portions of an image pertain to one another and to relevant physical phenomena?

•From one (or more) images, how can we determine the geometry of the scene?

Linear Regression

•model: straight line

•2 adjustable parameters

•fitness function: root mean squared error

Solution Stability

y-shift

Some Issues with Model Selection

normal

outliers

wrongmodel

Real Photo in Color Space

EM KMeans59

Conjugate Gradient

Newton’s Method

CSTalks - On machine learning - 2 Mar

Documents