Module 5.2022
Module 5.2022
Associated papers:
• Chemical biology.
What is Moore’s Law?
"Moore's law" is the observation that, over the history of
computing hardware, the number of transistors in a dense
integrated circuit doubles approximately every two years.
The observation is named after Gordon E. Moore, co-
founder of the Intel Corporation, who first described the
trend in a 1965 paper[1][2][2][3] and formulated its current
statement in 1975. His prediction has proven to be accurate,
in part because the law now is used in the semiconductor
industry to guide long-term planning and to set targets for
research and development.[4]
http://www.synthesis.cc/2013/04/updated-
dna-cost-and-productivity-curves-plus-a-few-
more-thoughts-on-moores-law.html
Maybe a more intuitive way to look at it
http://www.synthesis.cc/cgi-bin/mt/mt-
search.cgi?blog_id=1&tag=Carlson%20Curves
And here’s what that might mean to you ….
Synthetic Biology
N-terminal domain: Gives the characteristic concave shape for the enzyme.
Involved in promoter recognition and DNA melting during transcription initiation.
Active site: Located in the deep pocket bound by fingers, thumb, palm and N-terminal domain.
Like most everything else in biology, the job of T7 RNA
polymerase is to help its gene / genome (the bacteriophage)
survive. Incidentally, this means that it actually *doesn’t*
want to talk to the cell, which it is in the process of blowing
up. Here’s what the genome of the T7 bacteriophage looks
like:
Promoters
Proteins
This set of promoters actually allows the T7 bacteriophage to do
an ordered ‘dance’ of transcription that results in just the right
genes being made and translated at just the right times in the
viral life cycle … without any interference by its prey, E. coli:
Promoter specificity
Tyr 639
BASEN BASE(N+1)
H O His 784
OH O
O O H N
N
O_
5' RNA O O
OH Amino acids
transcript O
2+
Pa O
O
incoming
ribonucleotide
R425, G542 ,Y639, H784
Mg O P
O 2+ O
Mg O Pyrophosphate
O leaving group
Asp 537 O O
P
O O
O_
Asp 812
No! Because then the ribosome would not know what to do with
the mRNAs that were made (although an entirely orthogonal first
*and* second genetic code based on modified nucleotides would be
totally cool!).
Parts aren’t’ just what nature gives us, they can be engineered: Letter
pubs.acs.org/synthbio
Meyer et al. (2015)
Directed Evolution of a Panel of Orthogonal T7 RNA Polymerase
Variants for in Vivo or in Vitro Synthetic Circuitry
Adam J. Meyer,† Jared W. Ellefson,† and Andrew D. Ellington*,†,‡,§
†
Institute for Cellular and Molecular Biology, ‡Department of Chemistry and Biochemistry, and §Center for Systems & Synthetic
Biology, University of Texas at Austin, Austin, Texas 78712, United States
*
S Supporting Information
T
What are they (we) trying to do? Create orthogonal
he field of synthetic biology relies on designed, synthetic
circuits made from genetic parts that function robustly
T7 RNAP specifically recognizes its promoter primarily
through the specificity loop (residues 739 to 770), especially by
polymerase:promoter combinations (new enzymes:new DNA
and orthogonally to the cell’s operating system. All too often,
synthetic parts are evolved, engineered, or mined to perform a
interactions made between residues R746, N748, R756, and
Q758 with the major groove from −7 to −11.18 Mutations to
by overlapdressing
PCR, agarose from oil and
gel-purified, water.
based purification. Full-length T7 RNAP genes were assembled
digested with restriction
endonucleases, and ligated into an expression plasmid for
further rounds of selection. The library began to converge on
The polymerases
• sequences
active after four rounds of in cellsand
selection, in after
turn drive
seven
Figure 1. Compartmentalized partnered replication selection scheme.
(Top) E. coli cells containing the Taq DNA polymerase gene under the production of Taq polymerase, a
rounds the library was dominated by a single variant (termed
CTGA-R7-1, Supplementary Table 2) that had the sequence
the control of a synthetic T7 promoter variant are transformed with a
saturation (or error-prone) mutagenesis library of T7 RNAP mutants. thermostable
L747I, N748T, R756T, and Q758K DNA polymerase,
(Supplementary under
Figure 1).
In order to further ascertain the functionality of CTGA-R7-1,
Variants capable of recognizing the promoter produce Taq DNA
polymerase protein. (Middle) Whole E. coli cells are compartmental- the control
we transformed an expressionofconstruct
a novel for theT7polymerase
RNA into
ized in a water-in-oil emulsion. The aqueous droplets also contain an E. colipolymerase promoter.
strain that also carried a GFP gene coupled to the
primers, dNTPs, and Taq DNA polymerase buffer. (Bottom) PCTGA promoter (Supplementary Figure 1). CTGA-R7-1
Emulsions are thermal cycled, leading to E. coli cell lysis and in vitro produced about 1% as much GFP from PCTGA as the wild-
PCR amplification of the T7 RNAP genes that led to the production of type T7 RNAP produced from PT7. This weak activity was in
Taq DNA polymerase during the in vivo expression step. PCR product line with the generally low efficiencies seen by others for
is recovered and prepared for the next round of selection.
specificity variants.24 We hypothesized, however, that further
mutations in the regions flanking the promoter specificity
ACS Synthetic Biology Letter
Note that the two primers are *different.* You need one for each strand. You’re going to be
designing your own primers soon. The ‘in between’ here would have been the T7 RNA
polymerase gene (or a portion thereof that could been inserted into a partial T7 RNA
polymerase gene
further rounds of selection. The library began to converge on
active sequences after four rounds of selection, and after seven
Figure 1. Compartmentalized partnered replication selection scheme. rounds the library was dominated by a single variant (termed
• The amplified DNA can be re-cloned into cells, the cells re-
(Top) E. coli cells containing the Taq DNA polymerase gene under
the control of a synthetic T7 promoter variant are transformed with a
CTGA-R7-1, Supplementary Table 2) that had the sequence
L747I, N748T, R756T, and Q758K (Supplementary Figure 1).
saturation (or error-prone) mutagenesis library of T7 RNAP mutants.
emulsified … and the competition between ‘good’ DNAs (with
Variants capable of recognizing the promoter produce Taq DNA
polymerase protein. (Middle) Whole E. coli cells are compartmental-
In order to further ascertain the functionality of CTGA-R7-1,
we transformed an expression construct for the polymerase into
good polymerase genes) and ‘bad’ DNAs (with less good
ized in a water-in-oil emulsion. The aqueous droplets also contain
primers, dNTPs, and Taq DNA polymerase buffer. (Bottom)
an E. coli strain that also carried a GFP gene coupled to the
PCTGA promoter (Supplementary Figure 1). CTGA-R7-1
polymerase genes) can continue.
Emulsions are thermal cycled, leading to E. coli cell lysis and in vitro
PCR amplification of the T7 RNAP genes that led to the production of
produced about 1% as much GFP from PCTGA as the wild-
type T7 RNAP produced from PT7. This weak activity was in
Taq DNA polymerase during the in vivo expression step. PCR product line with the generally low efficiencies seen by others for
is recovered and prepared for the next round of selection.
specificity variants.24 We hypothesized, however, that further
• After multiple cycles of selection and amplification, you can find
of T7 RNAP we generated a strain of E. coli in which the
mutations in the regions flanking the promoter specificity
determinants would provide substantive improvements in
individual protein variants (out of libraries of millions) that are
production of Taq DNAP was dependent on the use of a
synthetic T7 promoter. Individual cells were compartmental-
activity. We therefore subjected a larger region encompassing
most of the “fingers” domain18 (amino acid residues 663−793)
really good at the new promoters:
ized in a water-in-oil emulsion with 5′-biotinylated primers for
PCR amplification of the T7 RNAP specificity determining
to error-prone PCR (at a rate of two mutations per kb) and to
CPR optimization. After six further rounds of selection (with
Promoter Operator
Polymerase Repressor
Shine-Delgarno
sequence
lay
and
ogy
the What constitutes an
ade
ast- expression unit, in this
tep
ign diagram? What is
to
any minimally needed in order
For
set to express a protein?
na
oin
the
wly
an
tly,
in
of
heir
to
ple
zes
n a
nce
This modularity has in turn allowed researchers to construct useful
(and usable!) ‘toolkits’ of parts, that can be further built into
circuits.
ACS Synthetic Biology Research Article
Supplementary Figure S6
The different parts for any of these cloning methods can
come pre-made (ala the Golden Gate kit), or you can add
in your own parts via PCR with appropriate
oligonucleotides.
Your gene
and
* *
Examine the previous slide.
But keep in mind things are dynamic, always changing in the cell!
The circuit *maintains* itself, but if perturbed can reach a new set
state (for example … UV light)
a b
PflMI -35 -10 SD
A AGGAGGAAAAAAATG
Ptrc-2 CCATCGAATGGCTGAAATGAGCTGTTGACAATTAATCATCCGGCTCGTATAATGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGA
B AGGAATTTAAATG
Olac
C AGGAAACAGACCATG
SphI -35 -10 SD
D AGGAAACCGGTTCGATG
PL-s1con GCATGCACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAATACCACTGGCGGTtATAaTGAGCACATCAGCAGG//GTATGCAAAGGA
E AGGAAACCGGTTATG
OL3 OL2 OL1
F AGGACGGTTCGATG
SphI -35 -10 SD G AGGAAAGGCCTCGATG
PLtet0-1 GCATGCTCCCTATCAGTGATAGAGATTGACATCCCTATCAGTGATAGAGATACTGAGCACATCAGCAGGACGCACTGACCAGGA
H AGGACGGCCGGATG
Otet2 Otet2
What might cause the monostable states (see also next slide)?
a
Let’s focus on pTAK117, from 1 IPTG 42°C pTAK117
P1? 1 pTAK102
(control)
!
*%
0 '$
1 pIKE108 &*
Adjacent, look at the
(control)
&'
0 ,
performance of pTAK117: it c
0 5 10
Hours
15 20
2
0
toggles ‘on’ and stays ‘on’ in IPTG
GFP expression
1,600
8
,
the presence of IPTG; it 800
&'
No IPTG
toggles ‘off’ and stays ‘off’ in
46
0
'1
growth temperature is 37 !"#$%& ' !"#$%&'()'*$% $+ ,*&'),*-*'./ 01" 2(". &1)3*%2 *%3*4)'"& 5"(*$3& $+ 41"#*4)- $(
'1"(#)- *%364'*$%/ 01" -*%"& *% ( )%3 )7 81*41 )(" )55($9*#)'*$%& $+ '1" &8*'41*%2
-$
1
degrees C.). Why (see 3.%)#*4&7 )(" *%4-63"3 +$( 4-)(*'./ (7 50:; '$22-" 5-)&#*3& )%3 4$%'($-&/ )7 5<;= '$22-"
5-)&#*3& )%3 4$%'($-&/ *7 !"#$%&'()'*$% $+ '1" -$%2>'"(# &'),*-*'. $+ '1" &"5)()'"
(
("
worksheet)? "95("&&*$% &')'"& *% 50:;??@/ 3
Red dots = pTAK117;
Blue dots = pTAK102
What is going on
Here/?
What is FACS/?
How is the
information
obtained by
FACS different?
How does
this differ
from Type IV?
P1 Ptrc-2 P1 Ptrc-2
RBS1 rbs E RBS1 rbs E
GFPmut3
3/ R1
lacI
GFPmut3
*
Design a circuit to express GFP only when there was *no*
IPTG in the media.
How does RecA
impact cI?
Based on
this figure,
what do you
suppose
TraA does?
What does the thermal Why does the toggle decline in the
shift do? absence of inducer (*)? Isn’t that
counter to what it’s supposed to do/?
Why would such a system be useful?
“The partial decrease in fluorescence observed 12 h
after removal of IPTG reflected the relaxation from a
state where LacR is completely inactive to a stable state
where cI is the dominant repressor, but LacR still has
some basal activity. The stability of distinct expression
states was confirmed in a separate control experiment
where stable expression was observed for up to 50 h
(correpsonding to 50-60 generations) after the removal
of the inducing factor.”
What will this circuit do?
!"##"$% #& '(#)$"
a Repressilator Reporter -, !<2 "5 =8
9:=:3@8> 18
PLlac01
K6435@L8A2
ampR
tetR-lite #D8 5@;8
PLtet01 M@C2 .2 #8;
kanR :B874== @39
TetR TetR
pSC101 gfp-aav ?Q, ;@3658
λPR
origin 5@;82 #D8 4
λ cI LacI GFP =8B8=> :E S
8VD@R@5 :>9
lacI-lite
ColE1 A8587;@38A
λ cI-lite 743C8 :E N8
PLtet01 @3587B4=>I @
SMT =8B8=>
:38 43:5D8
434=H>@> :E
4B874C8 D4=
b =:3C87 5D43
9:3A@5@:3>
β
steady state
=@G8 5: A@>5@3C6@>D >69D >5:9D4>5@9 8EE895> E7:; N:>>@R=8 @357@3>@94==H
9:;N=8V AH34;@9> O>69D 4> @3587;@558398 :7 9D4:5@9 R8D4B@:67P2
OnM675D87
a cell-by-cell basis,5:it@A835@EH
>56A@8> 478 388A8A oscillates. This is5D8
43A 9D4749587@`8 actually
>:6798> what in
:E J695645@:3> @3 5D8 78N78>>@=45:7 43A :5D87 A8>@C38A 3851:7G>2 ^3
EEN475@96=47I
terms is=:3C87
called a ring oscillator.
8VN87@;835> N87E:7;8A 63A87 Why is EE terminology
9D8;:>545@9 9:3F
good for>D:6=A
A@5@:3> synthetic biology?
834R=8 ;:78 9:;N=858Why might
>545@>5@94= it be limiting?
9D4749587@`45@:3 :E
0 1,000
Time (min)
60 140 250 300 390 450 550 600
a
0 1000
min)
b
2%08 &,3&,$$);0, 80
%2, &)-2% 3&"/"%,&
60
,+'(,+ ;8 %2,
30.$/)+ C&)-2%E 40
.$/)+$* %&.#$(&)3: 20
"/ %2, ,- #.!& //01
,3&,$$)0.%"& /"+,0 0
0 100 200 300 400 500 600
.+8 $%.%, )$ $%.;0, Time (min)
.&),$ ;,%6,,# %2,
Represent the function of the repressilator …
via FACS. What would this look like? Why?
yemGFP M
0 60 120 180 T
pLlacO-1 Time (min)
y
Figure 3 | An oscillator with no positive feedback loop. a, Network diagram t
of What does feedback
the negative this circuit do? This oscillator is similar to the dual-
oscillator. d
feedback oscillator except that the hybrid promoter pLlacO-1 (ref. 14) gives t
expression of lacI and yemGFP in the absence of LacI or in the presence of a
How
IPTG does requiring
without it work?an activator. b, Single-cell density map trajectories s
for cells containing this oscillator (see Supplementary Movie 11 and p
What controls
Supplementary Fig. the
5). amplitude and period of oscillation? o
518
Stricker et al. (2008), Nature 456:516 ©2 0 0 8 Macmillan Publis hers Lim
un-
ach, a + Arabinose b
12
Fluorescence (a.u.)
been
nt11 araC yemGFP 8
ered
per- − IPTG 4
The
work lacI 0
0 60 120 180
back Time (min)
-cell c d
and What does araC do/?
cles.
e of How does this circuit modify the previous one?
ited
tion Why? What control does it provide?
ucer
del-
As
0 IPTG60 or arabinose
120 are
180increased,
0 what
60 will happen
120 to the
180
ng a e oscillations? f
oop,
pro-
The
Is AraC inhibition
cooperative?
, we found
Nor
Mol
con
(×
obtained 0 0
0 80 160 0 30 60
Time (min) Time (min)
n of inducer d 50 e
begins with 60
moters,
Period (min)
Period (min)
rt delay
30
uicker than
s occurs for
umber 10
ator 10
0 10 20 30 0 1 2 3
d, assuming IPTG (mM) Arabinose (%)
of both
abundant Figure 4 | Modelling the genetic oscillator. a, Intermediate processes are
ce as many explicitly modelled in the refined oscillator model. b, c, Simulation results
AraC levels Does
fromthe impact
Gillespie of IPTG
simulations concentration
(b) or also(cmake
deterministic modelling sense?
) at 0.7%
e to the arabinose and 2 mM IPTG. AraC dimers (green), LacI tetramers (red) and
sufficient lacI mRNA (black) are shown. d, e, Comparison of modelling and
ff and the Forexperiment
these diagrams toperiod
for oscillation be true,
at 0.7%which part
arabinose (d) oris2controlling
mM IPTG
(e). Values from deterministic modelling (blue curve), stochastic
yed, the
egins anew.
thesimulations
circuit?(grey symbols, Supplementary Fig. 18), and microscopy (red
me required diamonds) or flow cytometry (green circles) are shown. Lower and upper
on the rate error bars in d represent the 16th and 84th percentiles, respectively, of the
urst. stochastic data, corresponding to 61 s.d. for a normal distribution.
tion
s roughly
design of engineered gene circuits. We found that a full model of the
system that takes into account intermediate steps such as multimeri-
What does this circuit do?
aM-Pp = aM pentapeptide