+ All Categories
Home > Documents > ML for Scent - MIT Deep Learning...

ML for Scent - MIT Deep Learning...

Date post: 16-May-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
46
ML for Scent Alex Wiltschko, Benjamin Sanchez-Lengeling, Brian Lee, Carey Radebaugh, Emily Reif, Jennifer Wei
Transcript
Page 1: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

ML for ScentAlex Wiltschko, Benjamin Sanchez-Lengeling, Brian Lee, Carey Radebaugh, Emily Reif, Jennifer Wei

Page 2: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Hi!

I’m Alex Wiltschko, a scientist at Google Research.

I lead a research group within Google Brain

that focuses on machine learning for olfaction.

Page 3: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Google Research

3500 Researchers & Engineers18 offices, 11 countries

Make machines intelligent. Improve people’s lives.

Page 4: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

● Foundational research

● Building tools to enable research & democratize AI/ML

● AI-enabling Google products

Our Approach

Page 5: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3
Page 6: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Do for olfaction what machine learning has already done for vision and hearing.

To digitize the sense of smell, and make the world’s smells and flavors searchable. Every flower patch, every natural gas leak, every item on every menu in every restaurant.

We’re starting at the very beginning, with the simplest problem…but first, some olfaction facts!

What’s our goal?

Page 7: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Most airflow is not smelled. Passes right on through the lower turbinates to your lungs.

The OSNs are one of two parts of your brain that are exposed to the world (the other is the pituitary gland, and that’s in blood, so only half-counts).

Taste lives on your tongue. Flavor is both taste and retronasal olfaction, from a “chimney effect”.

Page 8: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

GPCR: G-protein coupled receptorOR: GPCR Olfactory ReceptorOSN: Olfactory sensory neuron

~400 ORs expressed in humans (as opposed to 3 types of cones)~1000 in mice. ~2000 in elephants!

One OR per OSN.

ORs comprise 2% of your genome, but many are pseudogenes.

OR structure is unknown, they are uncrystallized. Further, only ~40 expressed in cell lines.

Their ligand responses are broadly tuned, but many ORs (22/400) are still orphans, with no known ligand.

Page 9: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3
Page 10: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3
Page 11: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

People do smell different things!

Mainland et al 2015

SNPs in single ORs result in sensory dimorphisms. The most famous ones are:

● OR7D4 T113M: normally funky beta-androstenone (boar taint) is rendered pleasant.

● OR5A1 N183D: nearly completely Mendelian. Carriers of the mutation can detect beta-ionine at two orders of magnitude lower concentration

● Olfactory sensory dimorphisms are likely common — humans differ functionally at 30% of OR alleles.

● ~4.5% of the world is colorblind (CBA)● 13% in the US has selective hearing loss (NIDCD)● All this to argue — smell is not defacto finicky or illogical.

Right now, we’re starting with the simplest problem

Page 12: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Predict

“Smells sweet, with a hint of vanilla, some notes of creamy and back note of chocolate.”

Odor descriptors

Page 13: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

And why is this hard?

Page 14: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

We built a benchmark from perfumery raw materials

Page 15: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Vanillin1: sweet, vanilla, creamy, chocolate2: sweet, vanilla, creamy, phenolic

General agreement between repeated ratings. All ratings by perfume experts.

We built a benchmark from perfumery raw materials

Page 16: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

...

solvent

orangeflower

bready

black currant

radish

fruitygreen

sweetfloralwoody

We built a benchmark from perfumery raw materials

Page 17: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

We built a benchmark from perfumery raw materialsodors

odor

s

Page 18: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Historical SOR approachesPen & Paper

Rule-based principles for predicting odor. There are as many exceptions as there are rules.

Kraft’s vetiver rule

(-)-khusimone 1,7-cyclogermacra-1(10),4-dien-15-al

4,7,7-Trimethyl-1-methylidenespiro[4.5]decan-2-one

Fig 3.22 Scent and Chemistry (Ohloff, Pickenhagen, Kraft)

Ohloff’s ruleBajgrowicz and Broger’s ambergris

osmophore modelBuchbauer’s santalolsBoelens’ synthetic muguet

Page 19: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Traditional Computational Approaches

Predict ● Toxicity● Solubility● Photovoltaic

efficiency (solar cell)● Chemical potential

(batteries)● ...

“bag of sub-graphs” representation AKA molecular fingerprints

Page 20: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Labeled Photos

“cat”

“dog”

“car”

“apple”

“flower”

Page 21: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Unlabeled Photo

Page 22: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

“Hello, how are you?”

PIXELS

AUDIO

TEXT

PIXELS

“lion”

“How cold is it outside?”

“你好,你好吗?”

“A blue and yellow train travelling down the tracks”

Input Output

Page 23: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3
Page 24: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Graphs as input to neural networks: not just images, sounds or words

Page 25: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Inside a GNNConverting a molecule to a graph

Page 26: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Inside a GNNPropagating information & transforming a graph

Page 27: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

A GNN to predict odor descriptors

Page 28: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

And how well can we predict?

Page 29: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

A representation optimized for odor

Last layer embeddings63 dimension vector

Page 30: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Exploring the geometric space of odor

Page 31: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Exploring the geometric space of odor

Page 32: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

What do nearby molecules look like?Inspired by word embeddings. Are there “molecular synonyms”?

First, what do “nearest neighbors” look like if you use just structure, and ignore our neural network?

Then, what do nearest neighbors look like to our GCN?

Page 33: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

herbal, nutty, coconut, coumarinic, cinnamon, sweet, hay, tobacco

dihydrocoumarin

Molecular neighbors: using structure

Acetyl thymol

Tolyl decanoate

berry, medicinal,

fruity, phenolic

medicinal, sweet, fruity, floral

smoky, spicy, balsamic

sweet, phenolic, floral

spicy

ortho-cresyl isobutyrate ortho-cresyl acetate

ethyl 3-(2-hydroxyphenyl) propionate

Page 34: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

2-benzofuran carboxaldehyde

coumarin

green, coumarinic

phenolic, hay, lactonic, coconut, coumarinic, almond, sweet, powdery

sweet, nutty, almond

sweet, coumarinic, hay

green, vanilla, nutty,

coumarinic, spicy

1,4-benzodioxin-2(3H)-one

coumane

phthalide

Molecular neighbors: using GCN features

herbal, nutty, coconut, coumarinic, cinnamon, sweet, hay, tobacco

dihydrocoumarin

Page 35: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

You might hear ‘fine-tuning’ referred to as a strategy for ‘transfer learning’.

Transfer learning in chemistry, today, rarely works. Do our embeddings transfer learn to other tasks?

Do these representations generalize?Using a learned model to make predictions on a new task is ‘transfer learning’

Page 36: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Do these representations generalize?

Page 37: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

DREAM Olfactory Challenge Dravnieks

Transfer-learned to achieve state-of-the-art on the two major olfactory benchmark tasks

Page 38: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions?Toy test example: classify whether a molecule has benzene. Which atoms contribute to predictions?

Benzene?

This is just one task of potentially hundreds, of varying complexity.

Page 39: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions?Toy test example: classify whether a molecule has benzene. Which atoms contribute to predictions?

Page 40: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions?Toy test example: classify whether a molecule has benzene. Which atoms contribute to predictions?

Positive examples

Negative examples

Page 41: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions? Odor percept — “garlic”

Positive examples

Negative examples

Page 42: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions? Odor percept — “fatty”

Positive examples

Negative examples

Page 43: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions? Odor percept — “vanilla”

Positive examples

Negative examples

Page 44: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

But why is the neural network making these predictions? Odor percept — “winey”

Positive examples

Negative examples

Page 45: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

Σ( )Σ( )

Page 46: ML for Scent - MIT Deep Learning 6.S191introtodeeplearning.com/slides/6S191_MIT_DeepLearning_L10.pdf · OSN: Olfactory sensory neuron ~400 ORs expressed in humans (as opposed to 3

● Test ML-driven molecular design for humans in a safe context.

● Build bedrock understanding in single-molecules before working on odor mixtures

● Build a foundational dataset for the ML on molecules community.

Collecting interest & those interested in collaborating.

Future Directions

Benjamin Sanchez-LengelingBrian LeeCarey RadebaughEmily ReifJennifer WeiAlex Wiltschko


Recommended