+ All Categories
Home > Documents > O M L TREE-BASED MODELSwp.doc.ic.ac.uk/rmisener/.../sites/...stem_poster.pdf · OPTIMISING OVER...

O M L TREE-BASED MODELSwp.doc.ic.ac.uk/rmisener/.../sites/...stem_poster.pdf · OPTIMISING OVER...

Date post: 26-Sep-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
1
O PTIMISING OVER M ACHINE L EARNING T REE -B ASED M ODELS Miten Mistry 1 , Dimitrios Letsios 1 , Gerhard Krennrich 2 , Robert M. Lee 2 , and Ruth Misener 1 1 Imperial College London, UK 2 BASF SE, Germany O PTIMISING OVER M ACHINE L EARNING T REE -B ASED M ODELS Miten Mistry 1 , Dimitrios Letsios 1 , Gerhard Krennrich 2 , Robert M. Lee 2 , and Ruth Misener 1 1 Imperial College London, UK 2 BASF SE, Germany STEM for Britain 2019 [email protected] Mathematical Modelling: Exact Data Analysis Data can involve many variables, contain measurement errors, or may be very large. Deriving an exact mathematical model for the data is a costly task requiring expertise and time. These mathematical models are derived from our understanding of the physical world. Exact mathemat- ical models are readily incorporated into decision-making problems. Acquire Data Costly Analysis f ( x ) Exact Mathematical Model Machine Learning: Approximate Data Analysis Machine learning approximates how data behaves without needing con- textual knowledge. Machine learning is cheaper than exact analysis. While machine learning models are effective, they tend to be complex. We question how to integrate and manage complex machine learning models into larger decision-making problems? Acquire Data Machine Learning ˆ f ( x ) Approximate Predictive Model Decision Trees: Explainable Machine Learning Decision Tree Is x 1 3? Is x 2 4? 2 0.4 no yes Is x 2 3? -3.1 7 no yes no yes Input: x 1 = 4, x 2 = 2. Output: 0.4. Query a sequence of yes/no ques- tions to find prediction. Sequence of yes/no responses explain a de- cision tree’s prediction. Tree-Based Model Contains several decision trees that collaboratively outperform a single decision tree. Prediction sums all decision tree responses. Applications Catalysis Catalysts speed up chemical reactions and are es- sential for energy efficiency at BASF. BASF finds tree-based models effective for modelling catalyst be- haviour. Developing the best-performing catalysts re- quires optimising over the BASF tree-based models. Concrete Mixture Design Concrete is a fundamental building material that ob- tains different properties dependent on ingredient proportions. Tree-based models can predict con- crete properties and offer explanations for the predic- tion. We can repurpose strength predicting tree-based models for an optimisation context. From Prediction to Optimal Decision-Making Goal: Integrate and manage tree-based models in larger decision-making problems. Acquire data Learn tree-based model Optimise over tree-based model Difficulty in Optimising Tree-Based Models Optimising over tree-based models is difficult because they lack smooth- ness. Our research considers how to leverage the tree-based structure inherent to these machine learnt functions. Tree-based models move in steps. This makes optimisation challenging. Where a Tree-Based Model Makes Sense A tree-based model is trustworthy in regions close to data and acquir- ing more data may be expensive. We represent trustworthiness with a penalty that makes regions further from data less optimal. : High penalty : Low penalty : Data Decision-Making Optimisation Problem: OPTIMISE Tree-based model + Penalty Penalise regions further from data Guaranteeing Decision Quality We develop an algorithm that guarantees the best solution of the decision-making optimisation problem. The algorithm automatically analyses tree-based model structure to dynamically split the problem into easier-to-solve subproblems. Key algorithm elements: Leverage efficient mathematical solvers Specialised approximation for tree-based models Removing non-optimal regions by dynamically dividing do- main. 0 0.5 1 -800 -400 0 Time/hours Bound Closeness to the green dashed line yields a better guarantee. Our ap- proach (red line) outperforms off-the- shelf approaches (blue line). Getting Good Decisions Quickly Particle swarm optimisation: Generate particles near low penalty regions and focus search in a collab- orative manner. Decomposing tree-based models: Approximate tree-based model with a decomposition and use a mathe- matical solver. Iteratively generate multiple candidate solutions. Future Work Design methods for handling additional machine learnt models in a decision-making context. Develop freely available decision-making software that integrates and manages machine learnt functions. Acknowledgements The support of BASF SE, Lugwigshafen am Rhein, the EPSRC Centre for Doctoral Training in High Performance Embedded and Distributed Systems to M.M. (HiPEDS, EP/L016796/1) and an EPSRC Research Fellowship to R.M. (EP/P016871/1) is gratefully acknowledged. References [1] M. Mistry, D. Letsios, R. Misener, G. Krennrich, and R. M. Lee. Mixed-Integer Convex Non- linear Optimization with Gradient-Boosted Trees Embedded. ArXiv, 2018. arXiv:1803.00952.
Transcript
Page 1: O M L TREE-BASED MODELSwp.doc.ic.ac.uk/rmisener/.../sites/...stem_poster.pdf · OPTIMISING OVER MACHINE LEARNING TREE-BASED MODELS Miten Mistry1, Dimitrios Letsios1, Gerhard Krennrich2,

OPTIMISING OVER MACHINE LEARNING TREE-BASED MODELSMiten Mistry1, Dimitrios Letsios1, Gerhard Krennrich2, Robert M. Lee2, and Ruth Misener1

1Imperial College London, UK 2BASF SE, Germany

OPTIMISING OVER MACHINE LEARNING TREE-BASED MODELSMiten Mistry1, Dimitrios Letsios1, Gerhard Krennrich2, Robert M. Lee2, and Ruth Misener1

1Imperial College London, UK 2BASF SE, Germany

STEM for Britain [email protected]

Mathematical Modelling: Exact Data Analysis

Data can involve many variables, contain measurement errors, or maybe very large. Deriving an exact mathematical model for the data is acostly task requiring expertise and time. These mathematical models arederived from our understanding of the physical world. Exact mathemat-ical models are readily incorporated into decision-making problems.

AcquireData

CostlyAnalysis f (x)

ExactMathematical

Model

Machine Learning: Approximate Data Analysis

Machine learning approximates how data behaves without needing con-textual knowledge. Machine learning is cheaper than exact analysis.While machine learning models are effective, they tend to be complex.We question how to integrate and manage complex machine learningmodels into larger decision-making problems?

AcquireData

MachineLearning f̂ (x)

ApproximatePredictive

Model

Decision Trees: Explainable Machine Learning

Decision Tree

Is x1 ≥ 3?

Is x2 ≥ 4?

20.4

no yes

Is x2 ≥ 3?

−3.17

no yes

no yes

Input: x1 = 4, x2 = 2.

Output: 0.4.

Query a sequence of yes/no ques-tions to find prediction. Sequenceof yes/no responses explain a de-cision tree’s prediction.

Tree-Based Model

Contains several decision treesthat collaboratively outperform asingle decision tree. Predictionsums all decision tree responses.

Applications

CatalysisCatalysts speed up chemical reactions and are es-sential for energy efficiency at BASF. BASF findstree-based models effective for modelling catalyst be-haviour. Developing the best-performing catalysts re-quires optimising over the BASF tree-based models.

Concrete Mixture DesignConcrete is a fundamental building material that ob-tains different properties dependent on ingredientproportions. Tree-based models can predict con-crete properties and offer explanations for the predic-tion. We can repurpose strength predicting tree-basedmodels for an optimisation context.

From Prediction to Optimal Decision-Making

Goal: Integrate and manage tree-based models in larger decision-making problems.

Acquire data Learn tree-based model Optimise over tree-based model

Difficulty in Optimising Tree-Based ModelsOptimising over tree-based models is difficult because they lack smooth-ness. Our research considers how to leverage the tree-based structureinherent to these machine learnt functions.

Tree-based models move in steps. This makes optimisation challenging.

Where a Tree-Based Model Makes SenseA tree-based model is trustworthy in regions close to data and acquir-ing more data may be expensive. We represent trustworthiness with apenalty that makes regions further from data less optimal.

: High penalty

: Low penalty

: Data

Decision-Making Optimisation Problem:

OPTIMISE Tree-based model + Penalty

Penalise regionsfurther from data

Guaranteeing Decision Quality

We develop an algorithm that guarantees the best solution of thedecision-making optimisation problem. The algorithm automaticallyanalyses tree-based model structure to dynamically split the probleminto easier-to-solve subproblems.

Key algorithm elements:

• Leverage efficient mathematicalsolvers

• Specialised approximation fortree-based models

• Removing non-optimal regionsby dynamically dividing do-main.

0 0.5 1−800

−400

0

Time/hours

Boun

d

Closeness to the green dashed lineyields a better guarantee. Our ap-proach (red line) outperforms off-the-shelf approaches (blue line).

Getting Good Decisions Quickly

Particle swarm optimisation:Generate particles near low penalty regions and focus search in a collab-orative manner.

Decomposing tree-based models:Approximate tree-based model with a decomposition and use a mathe-matical solver. Iteratively generate multiple candidate solutions.

Future Work

Design methods for handling additional machine learnt models in adecision-making context.Develop freely available decision-making software that integrates andmanages machine learnt functions.

Acknowledgements

The support of BASF SE, Lugwigshafen am Rhein, the EPSRC Centre for Doctoral Training inHigh Performance Embedded and Distributed Systems to M.M. (HiPEDS, EP/L016796/1) andan EPSRC Research Fellowship to R.M. (EP/P016871/1) is gratefully acknowledged.

References

[1] M. Mistry, D. Letsios, R. Misener, G. Krennrich, and R. M. Lee. Mixed-Integer Convex Non-linear Optimization with Gradient-Boosted Trees Embedded. ArXiv, 2018. arXiv:1803.00952.

Recommended