O M L TREE-BASED MODELSwp.doc.ic.ac.uk/rmisener/.../sites/...stem_poster.pdf · OPTIMISING OVER...

Post on 26-Sep-2020

0 views 0 download

transcript

OPTIMISING OVER MACHINE LEARNING TREE-BASED MODELSMiten Mistry1, Dimitrios Letsios1, Gerhard Krennrich2, Robert M. Lee2, and Ruth Misener1

1Imperial College London, UK 2BASF SE, Germany

OPTIMISING OVER MACHINE LEARNING TREE-BASED MODELSMiten Mistry1, Dimitrios Letsios1, Gerhard Krennrich2, Robert M. Lee2, and Ruth Misener1

1Imperial College London, UK 2BASF SE, Germany

STEM for Britain 2019miten.mistry11@imperial.ac.uk

Mathematical Modelling: Exact Data Analysis

Data can involve many variables, contain measurement errors, or maybe very large. Deriving an exact mathematical model for the data is acostly task requiring expertise and time. These mathematical models arederived from our understanding of the physical world. Exact mathemat-ical models are readily incorporated into decision-making problems.

AcquireData

CostlyAnalysis f (x)

ExactMathematical

Model

Machine Learning: Approximate Data Analysis

Machine learning approximates how data behaves without needing con-textual knowledge. Machine learning is cheaper than exact analysis.While machine learning models are effective, they tend to be complex.We question how to integrate and manage complex machine learningmodels into larger decision-making problems?

AcquireData

MachineLearning f̂ (x)

ApproximatePredictive

Model

Decision Trees: Explainable Machine Learning

Decision Tree

Is x1 ≥ 3?

Is x2 ≥ 4?

20.4

no yes

Is x2 ≥ 3?

−3.17

no yes

no yes

Input: x1 = 4, x2 = 2.

Output: 0.4.

Query a sequence of yes/no ques-tions to find prediction. Sequenceof yes/no responses explain a de-cision tree’s prediction.

Tree-Based Model

Contains several decision treesthat collaboratively outperform asingle decision tree. Predictionsums all decision tree responses.

Applications

CatalysisCatalysts speed up chemical reactions and are es-sential for energy efficiency at BASF. BASF findstree-based models effective for modelling catalyst be-haviour. Developing the best-performing catalysts re-quires optimising over the BASF tree-based models.

Concrete Mixture DesignConcrete is a fundamental building material that ob-tains different properties dependent on ingredientproportions. Tree-based models can predict con-crete properties and offer explanations for the predic-tion. We can repurpose strength predicting tree-basedmodels for an optimisation context.

From Prediction to Optimal Decision-Making

Goal: Integrate and manage tree-based models in larger decision-making problems.

Acquire data Learn tree-based model Optimise over tree-based model

Difficulty in Optimising Tree-Based ModelsOptimising over tree-based models is difficult because they lack smooth-ness. Our research considers how to leverage the tree-based structureinherent to these machine learnt functions.

Tree-based models move in steps. This makes optimisation challenging.

Where a Tree-Based Model Makes SenseA tree-based model is trustworthy in regions close to data and acquir-ing more data may be expensive. We represent trustworthiness with apenalty that makes regions further from data less optimal.

: High penalty

: Low penalty

: Data

Decision-Making Optimisation Problem:

OPTIMISE Tree-based model + Penalty

Penalise regionsfurther from data

Guaranteeing Decision Quality

We develop an algorithm that guarantees the best solution of thedecision-making optimisation problem. The algorithm automaticallyanalyses tree-based model structure to dynamically split the probleminto easier-to-solve subproblems.

Key algorithm elements:

• Leverage efficient mathematicalsolvers

• Specialised approximation fortree-based models

• Removing non-optimal regionsby dynamically dividing do-main.

0 0.5 1−800

−400

0

Time/hours

Boun

d

Closeness to the green dashed lineyields a better guarantee. Our ap-proach (red line) outperforms off-the-shelf approaches (blue line).

Getting Good Decisions Quickly

Particle swarm optimisation:Generate particles near low penalty regions and focus search in a collab-orative manner.

Decomposing tree-based models:Approximate tree-based model with a decomposition and use a mathe-matical solver. Iteratively generate multiple candidate solutions.

Future Work

Design methods for handling additional machine learnt models in adecision-making context.Develop freely available decision-making software that integrates andmanages machine learnt functions.

Acknowledgements

The support of BASF SE, Lugwigshafen am Rhein, the EPSRC Centre for Doctoral Training inHigh Performance Embedded and Distributed Systems to M.M. (HiPEDS, EP/L016796/1) andan EPSRC Research Fellowship to R.M. (EP/P016871/1) is gratefully acknowledged.

References

[1] M. Mistry, D. Letsios, R. Misener, G. Krennrich, and R. M. Lee. Mixed-Integer Convex Non-linear Optimization with Gradient-Boosted Trees Embedded. ArXiv, 2018. arXiv:1803.00952.