(282e) Mixed-Integer Programming Representations of Linear Model Decision Tree Surrogates
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Computing and Systems Technology Division
Data-driven and Surrogate Optimization in Operation I
Tuesday, November 7, 2023 - 1:54pm to 2:15pm
Linear model decision trees differ from standard decision trees by returning linear regression models rather than constants at the leaf nodes. Among many of their advantages include their ability to represent discontinuous functions, and potentially approximate arbitrary functions with smaller trees and reduced error. When embedding these smaller linear model decision trees within optimization problems, their fewer leaves correspond to fewer constraints. Multiple MILP and MIQCP representations of linear model decision trees have been developed utilizing Generalized Disjunctive Programming (GDP) formulations, extensions of existing formulations for standard trees, and hybrid Big-M methods.
We present several case studies, including process family design (Stinchfield et al., 2022), flexibility analysis (Swaney & Grossmann, 1985), and global optimization that showcase the benefits of linear model decision trees. We also investigate the computational performance of different representations for linear model decision trees, including both MILP and MICQP formulations, and discuss the properties of these formulations. Finally, we compare the performance of linear model decision trees against other ML models, including GBDTs using the Optimization and Machine Learning Toolkit (OMLT) (Ceccon et al., 2022).
Acknowledgements
Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energyâs National Nuclear Security Administration under contract DE-NA0003525. This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. Department of Energy or the United States Government
References
Ceccon, F., Jalving, J., Haddad, J., Thebelt, A., Tsay, C., Laird, C. D., and Misener, R. (2022). âOMLT: Optimization & Machine Learning Toolkit.â Journal of Machine Learning Research, 23(349):1â8.
Fischetti, M. and Jo, J. (2018). âDeep Neural Networks and Mixed Integer Linear Optimization.â Constraints, 23:296â309.
Mistry, M., Letsios, D., Krennrich, G., Lee, R. M., and Misener, R. (2021). âMixed-integer Convex Nonlinear Optimization with Gradient-boosted Trees Embedded.â INFORMS Journal on Computing, 33:1103â1119.
MiÅ¡iÄ, V. V. (2020). âOptimization of Tree Ensembles.â Operations Research, 68:1605â1624.
Stinchfield, G., Biegler, L. T., Eslick, J. C., Jacobson, C., Miller, D. C., Siirola, J. D., Zamarripa A., Zhang, C., Zhang, Q., and Laird, C. D. (2022). âOptimization-based Approaches for Design of Chemical Process Families Using ReLU Surrogates.â in proceedings of Foundations of Computer Aided Process Operations, Jan. 2023.
Swaney, R. E. and Grossmann, I. E. (1985). âAn Index for Operational Flexibility in Chemical Process Design.â AIChE Journal, 31:621â630
Vielma, J. P., Ahmed, S., and Nemhauser, G. (2010). âMixed-integer Models for Nonseparable Piecewise-linear Optimization: Unifying Framework and Extensions.â Operations Research, 58:303â315.