A Rule-Based Library for the Verification and Modification of Synthetic DNA Sequences

Conference

Synthetic Biology Engineering Evolution Design SEED

Year

2015

Proceeding

2015 Synthetic Biology: Engineering, Evolution & Design (SEED)

Group

Thursday, June 11, 2015 - 5:30pm to 7:00pm

Authors

Oberortner, E. - Presenter, Lawrence Berkeley National Laboratory

Deutsch, S., DOE Joint Genome Institute

Cheng, J. F., Lawrence Berkeley National Laboratory

Hillson, N. J., DOE Joint BioEnergy Institute

Many synthetic biology research groups design complex DNA sequences in silico according to biological discoveries and rules, but order the physical sequence from DNA synthesis vendors. Before the physical synthesis process starts, at latest, the designed sequences must be verified against rules of DNA synthesis methods and technologies. If a sequence violates a rule, then corrective actions are performed manually that depend on the type of rule and the designers' knowledge. Moreover, manual sequence modifications are hard to reenact, error-prone, time-consuming, and circumvent fully automated workflows.

Various algorithms have been developed to verify sequences against rules in silico. Biological advances have shown that rules can be mainly categorized into two types: repeats and GC content. Both types can occur in the entire sequence (i.e. globally) or in specific regions (i.e. locally). Direct repeats denote k-mers (e.g. polymers, dimers, trimers) and inverted repeats indicate potential undesired secondary structures (e.g. hairpins). Here, we demonstrate the Sequence Polishing Library (SPL) and its integration into an automated workflow at the DOE Joint Genome Institute. Currently, the SPL imports sequence information from GenBank, FASTA, FASTQ, or SBOL file formats. Besides finding only exact matches of sequence repeats, the SPL can find repeats to a configurable degree of similarity. Therefore, SPL programmatically integrates the BBMap bioinformatics library (http://sourceforge.net/projects/bbmap).

For the specification of rules, we develop a yet simple but expressive language, which also supports the specification of corrective actions. The SPL differentiates between human, recommended, and automated corrective actions. Per default, human corrective actions must be performed since which action to perform depends on where and what type of violation occurs. In order to modify sequences in an automated fashion, information is needed about the sequences' structure. For example, if a coding sequence contains repeats, then organism-specific codon tables can be consulted to modify the DNA sequence. If a promoter violates a rule, then the SPL can recommend alternative promoters to the designer. Such structural information can be specified, for example, using the GenBank or SBOL format.

The SPL can be invoked programmatically via an Application Programming Interface (API) and utilized manually via a web-based User Interface (UI). Both interfaces provide expressive feedback about rule violations and corrective actions. Therefore, we believe that synthetic biology researchers as well as DNA synthesis vendors can benefit from utilizing the SPL, contributing to the automation of internal and cross-organizational workflows.

Topics

Synthetic Biology

Other Sites & Tools

Technical Groups

Technical

Professional/Personal Growth

Societal Needs

Leadership

2024 mRNA Technology Conference

5th Engineering Cosmetics and Consumer Products Conference

Upcoming Conferences & Events

2024 Eckhardt Northeast Student Regional Conference

2024 mRNA Technology Conference

5th Engineering Cosmetics and Consumer Products Conference

2024 DIERS Virtual Spring Meeting

2024 Pacific Northwest Student Regional Conference

2024 Western Student Regional Conference

CCPS Middle East Regional Meeting

Hydrogen Fueling Station Safety

Streamlining Permit-to-Work Processes With a Digital Solution

CEP: April 2024

CEP: March 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

A Rule-Based Library for the Verification and Modification of Synthetic DNA Sequences

Synthetic Biology Engineering Evolution Design SEED

2015

2015 Synthetic Biology: Engineering, Evolution & Design (SEED)

Poster Session

Poster Session A

Thursday, June 11, 2015 - 5:30pm to 7:00pm

Authors

Topics

More Conference Links

Cancelation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams