Building a drug screening model for RNA therapeutics

This is the full reflection. It’s still in progress, but I’ve written some early thoughts in another post. I’m currently prioritizing an explainer and building a usable web tool. Back to directory.

By mid-2025, I had already visited the emergency room 20 times, and the burden of Ehler’s Danlos Syndrome (hyperfragile skin) became too much. Whenever my family had the chance to go out (say) hiking, all it took was one twig and bam. All the plans we made were gone in a snap, and my parents’ hard-work wasted on stitches and late nights in the ER.

So I took to trusty old Google Search to find a treatment. But though it started simply as reading a few articles in my free time, by following one lead after another, I now find myself six months later with one of my biggest projects yet. It’s an ML screening model for discovering RNA-targeted drugs. I’m really proud (and also in disbelief) to say it now performs better than state-of-the-art binding affinity predictors.

Since then, I’ve learned there are far more important issues than EDS, like cancers and pandemic viruses, that we should prioritize this technology for. While I might not get a cure very soon for my own skin, I’m all the more excited to see where it goes. I’m now reflecting on findings so far in a paper and integrating the model into a web app for clinical use (will soon be more than a mockup). And between other experiments, I’ve been stealing moments to set up COL5A1 mRNA screening for EDS.

You can also find the model’s source code on Github.

I gained a lot of useful insights from this project so far. Here, I want to document how the dominos fell and discuss some of the takeaways beyond the paper.

This is partly an attempt to grapple with the whole discovery process. If you asked me just a year ago, I’d have barely any clue how “research” was conducted by professionals, let alone in one’s own room. From pure bouncing around, trying whatever looked most promising (and telling myself I’m just using this to learn ML), I’ve happened to stumble upon something new that works. Is this a fluke? Or might the piecemeal progress look anything like how new things are normally built?

I. Searching for a treatment

Back to that first weekend search. The early results Google yielded were thick sleeves, braces, creams, orthodics or PT, all symptom-level remedies the doctor already suggested. I wanted to find an actual pill or treatment, and dug deeper:

But these were still no cure. We make pills that change our brains and weaponize our own T-cells into living cancer drugs! Is this really the best we can do for EDS??

This was thrilling to discover. Yet, after reading about all the ongoing studies, I was still left with no fix. Do I just keep waiting? It seemed so far away. CRISPR has so far only acted on tiny targets (just the hypothalmus in the obesity study). Collagen genes are enormous and expressed in so many tissues, which would make delivery challenging even after we prove the concept.

II. Models, one after another

[still consolidating old scattered notes to piece together the timeline]

Update 2026-01-18: I’ve reflected on the overall process underlying this project in another essay. I’ll focus more on the technical insights of the model here.