(18f) Performance Limits and Optimization for T-Rflp Sensors | AIChE

(18f) Performance Limits and Optimization for T-Rflp Sensors

Authors 

Kantor, J. - Presenter, University of Notre Dame


Terminal restriction fragment polymorphism (t-rflp) of rDNA for the16S small ribosomal subunit has become a widely used method for analyzing microbial communities, with the potential for a new class of low-cost sensors of bacterial organisms for medical and environmental applications. Large databases of fragment lengths are now available from experimental data in addition to in-silico digestions of over 500,000 bacterial organisms and up 90 restriction enzymes. Bacterial identification, however, is limited by the number of distinct fragment lengths produced by digestion with typical restriction enzymes, and the ability to experimentally resolve fragment lengths.

In this paper we demonstrate limits to performance of sensors based on t-rflp. The analysis uses Shannon and Renyi information entropy as a priori and a posteriori measures of bacterial probability density function. The conditional ?information gain' is then a metric measuring the effectiveness of a sensor based on either single or multiple restriction enzymes. We demonstrate this principle by computing information gain and the number of distinguishable bacterial ?equivalence classes' of typical restriction enzymes used for the phylogenetic analysis of bacterial colonies in the human gut.

Typical t-rflp instrumental methods combine digests from multiple restriction enzymes. For this case, we formulate and solve for the maximum entropy aposteriori probability distribution for bacterial organisms. We further show how to optimize information gain for two characteristic cases. The first is optimal combination of multiple digests to form a single t-rflp chromatogram which leads to a large scale convex optimization problem. The second case of multiple enzymes and multiple length chromatograms leads to an intractably large optimization problem, but for which we can propose performance metrics.

These ideas are demonstrated using literature data for t-rflp in medical and ecological applications.