Shane Chu

C2H2 zinc finger as a genome editing tool

Zinc-Finger

C2H2 zinc finger protein.

At this time of writing, using a Zinc finger (C2H2-ZF) to edit DNA sequences sounds ridiculous.

There are much easier tools to edit DNA sequences, e.g., CRISPR Cas 9.

Like its name, C2H2-ZF proteins use their "fingers" to interact with the DNA. We now have pretty standard technologies to make the fingers of C2H2-ZF, i.e., the fingers are now our lego bricks. The bottleneck is that we don't have a clear picture of how the composition of the fingers of a C2H2-ZF affects its DNA binding sites.

Exhaustively checking the different compositions of C2H2-ZF and their binding sites is not an option since protein engineering is expensive.

Recognition code

From a computer science perspective, the above senario is a task of characterizing the following map

f:amino acid composition of a C2H2-ZFbinding sites\bm f : \textsf{amino acid composition of a C2H2-ZF} \rightarrow \textsf{binding sites}

where the map f\bm f is called the recognition code in biology. The recognition code is very much a machine-learning problem. Several established tools exist, such as the Interactive PWM predictor and RCADE, which seek to characterize the recognition code. Both tools characterize the binding sites as position weight matrices (PWM).

Whether to characterize the binding sites in the recognition code problem as a PWM is an open research question. PWM is a compact representation of the binding sites and a local representation. It offers easy interpretations but at the same time, making simplifying assumptions –– e.g., an independent contribution from each position to the binding specificity. Some argue that we need a more expressive representation to reveal more biological insights.

What's advantageous about C2H2 zinc fingers for genome editing?

C2H2 zinc fingers are smaller than other genome editing tools, giving it a more straightforward delivery process into the cell via viral vectors.


Update (4/7/2023):

Recently, an article was published about using ZF as a recognition code. I haven't looked into it in detail; if you are interested, see A universal deep-learning model for zinc finger design enables transcription factor reprogramming.

References

Contact me by E-mail | Github | Linkedin
This work is licensed under CC BY-SA 4.0. Last modified: January 15, 2025.
Website built with Franklin.jl and the Julia language.