(282e) Mixed-Integer Programming Representations of Linear Model Decision Tree Surrogates

Conference

AIChE Annual Meeting

Year

2023

Proceeding

2023 AIChE Annual Meeting

Group

Computing and Systems Technology Division

Session

Data-driven and Surrogate Optimization in Operation I

Time

Tuesday, November 7, 2023 - 1:54pm to 2:15pm

Authors

Ammari, B. - Presenter, Carnegie Mellon University

Laird, C., NA

Stinchfield, G., Carnegie Mellon University

Johnson, E., Sandia National Laboratories

Hart, W. E., Sandia National Laboratories

Bynum, M., Sandia National Laboratories

Kim, T.

Pulsipher, J., University of Wisconsin-Madison

Significant advances in mixed-integer linear programming (MILP) and mixed-integer quadratic constrained programming (MIQCP) solvers (e.g., Gurobi) have resulted in increased interest in piecewise-linear surrogates within optimization settings. Piecewise-linear functions can be formulated as MILPs (Vielma et al., 2010), and they have been utilized extensively to approximate nonlinear, nonconvex functions that are difficult to optimize. More recently, several piecewise-linear machine learning (ML) models such as neural networks (NNs) with rectified linear unit (ReLU) activation functions (Fischetti & Jo, 2018), ensembles of decision trees (Mistry et al., 2021; MiÅ¡iÄ‡, 2020) and linear model decision trees, have also been explored within optimization frameworks. However, given the many possible MILP and MIQCP representations of these piecewise-linear models and recent advances in solver capabilities, questions remain concerning the computational performance of these different representations when embedded within a broader optimization setting. In this work, we aim to gain insight into this general question by specifically evaluating the computational performance of linear model decision trees, ReLU NNs, and gradient boosted decision trees (GBDTs) embedded within MILPs and MIQCPs.

Linear model decision trees differ from standard decision trees by returning linear regression models rather than constants at the leaf nodes. Among many of their advantages include their ability to represent discontinuous functions, and potentially approximate arbitrary functions with smaller trees and reduced error. When embedding these smaller linear model decision trees within optimization problems, their fewer leaves correspond to fewer constraints. Multiple MILP and MIQCP representations of linear model decision trees have been developed utilizing Generalized Disjunctive Programming (GDP) formulations, extensions of existing formulations for standard trees, and hybrid Big-M methods.

We present several case studies, including process family design (Stinchfield et al., 2022), flexibility analysis (Swaney & Grossmann, 1985), and global optimization that showcase the benefits of linear model decision trees. We also investigate the computational performance of different representations for linear model decision trees, including both MILP and MICQP formulations, and discuss the properties of these formulations. Finally, we compare the performance of linear model decision trees against other ML models, including GBDTs using the Optimization and Machine Learning Toolkit (OMLT) (Ceccon et al., 2022).

Acknowledgements

Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energyâ€™s National Nuclear Security Administration under contract DE-NA0003525. This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. Department of Energy or the United States Government

References

Ceccon, F., Jalving, J., Haddad, J., Thebelt, A., Tsay, C., Laird, C. D., and Misener, R. (2022). â€œOMLT: Optimization & Machine Learning Toolkit.â€ Journal of Machine Learning Research, 23(349):1â€“8.

Fischetti, M. and Jo, J. (2018). â€œDeep Neural Networks and Mixed Integer Linear Optimization.â€ Constraints, 23:296â€“309.

Mistry, M., Letsios, D., Krennrich, G., Lee, R. M., and Misener, R. (2021). â€œMixed-integer Convex Nonlinear Optimization with Gradient-boosted Trees Embedded.â€ INFORMS Journal on Computing, 33:1103â€“1119.

MiÅ¡iÄ‡, V. V. (2020). â€œOptimization of Tree Ensembles.â€ Operations Research, 68:1605â€“1624.

Stinchfield, G., Biegler, L. T., Eslick, J. C., Jacobson, C., Miller, D. C., Siirola, J. D., Zamarripa A., Zhang, C., Zhang, Q., and Laird, C. D. (2022). â€œOptimization-based Approaches for Design of Chemical Process Families Using ReLU Surrogates.â€ in proceedings of Foundations of Computer Aided Process Operations, Jan. 2023.

Swaney, R. E. and Grossmann, I. E. (1985). â€œAn Index for Operational Flexibility in Chemical Process Design.â€ AIChE Journal, 31:621â€“630

Vielma, J. P., Ahmed, S., and Nemhauser, G. (2010). â€œMixed-integer Models for Nonseparable Piecewise-linear Optimization: Unifying Framework and Extensions.â€ Operations Research, 58:303â€“315.

Topics

Computing and Systems Engineering

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

6th Middle East Process Engineering Conference and Exhibition

Quantum Computing and Artificial Intelligence Applications Workshop

Upcoming Conferences & Events

2024 Pacific Northwest Student Regional Conference

2024 Western Student Regional Conference

CCPS Middle East Regional Meeting

Hydrogen Fueling Station Safety

Streamlining Permit-to-Work Processes With a Digital Solution

6th Middle East Process Engineering Conference and Exhibition

Quantum Computing and Artificial Intelligence Applications Workshop

2024 Offshore Technology Conference

Fourth AIChE Middle East Regional Chem-E-Car Competition

CEP: April 2024

CEP: March 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(282e) Mixed-Integer Programming Representations of Linear Model Decision Tree Surrogates

AIChE Annual Meeting

2023

2023 AIChE Annual Meeting

Computing and Systems Technology Division

Data-driven and Surrogate Optimization in Operation I

Tuesday, November 7, 2023 - 1:54pm to 2:15pm

Authors

Topics

More Conference Links

Visit Orlando

Universal Studios Offer

Cancellation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams