MIT News - Artificial intelligence 10月15日 00:25
Watershed Bio:赋能科学家,简化生物数据分析
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

随着诊断和测序技术的成本不断下降,生物学领域产生了海量数据。然而,科学家在将数据转化为新疗法时,常需软件工程方面的专业支持。Watershed Bio应运而生,其云平台提供工作流模板和可定制界面,使科学家和生物信息学家无论计算技能如何,都能轻松分析复杂的生物数据集,包括全基因组测序、转录组学、蛋白质组学等。该平台旨在弥合科学家与软件工程之间的鸿沟,让他们能更快地从海量数据中解锁洞见,加速疾病治疗的研发进程。

📊 **海量生物数据处理的挑战与Watershed Bio的解决方案**:近年来,诊断和测序技术的成本大幅下降,导致生物学领域产生了前所未有的海量数据。科学家们面临着如何从这些复杂数据中提取有价值信息以开发新疗法的挑战。Watershed Bio通过其云平台,提供了一套用户友好的工具,使用户能够分析包括全基因组测序、转录组学、蛋白质组学等在内的各种复杂数据集,而无需具备深厚的软件工程背景。

🚀 **“无代码”分析平台,加速科学发现**:Watershed Bio的核心在于其“无代码”分析能力,旨在让科学家专注于生物学研究本身,而非耗费时间学习编程。平台提供现成的工作流模板,并将最新的高级分析技术集成其中,使得前沿的计算工具更加易于获取和协作。这极大地缩短了从数据到洞察的周期,让研究人员能够更快地进行实验设计、实施和迭代,从而加速科学发现的进程。

💡 **弥合科学研究与工程实现的差距**:Watershed Bio的创始人Jonathan Wang在金融和学术界都曾遇到过研究人员与工程师之间的沟通瓶颈,意识到在生物学领域也存在类似问题。科学家们渴望利用数据解决疾病问题,但受限于计算能力的不足或软件工程技能的缺乏。Watershed Bio通过提供一个能够让科学家“像在笔记本上构建原型一样轻松地构建生产级模型”的软件层,有效解决了这一效率低下和信息不对称的问题,让生物学研究更加高效和可及。

As costs for diagnostic and sequencing technologies have plummeted in recent years, researchers have collected an unprecedented amount of data around disease and biology. Unfortunately, scientists hoping to go from data to new cures often require help from someone with experience in software engineering.

Now, Watershed Bio is helping scientists and bioinformaticians run experiments and get insights with a platform that lets users analyze complex datasets regardless of their computational skills. The cloud-based platform provides workflow templates and a customizable interface to help users explore and share data of all types, including whole-genome sequencing, transcriptomics, proteomics, metabolomics, high-content imaging, protein folding, and more.

“Scientists want to learn about the software and data science parts of the field, but they don’t want to become software engineers writing code just to understand their data,” co-founder and CEO Jonathan Wang ’13, SM ’15 says. “With Watershed, they don’t have to.”

Watershed is being used by large and small research teams across industry and academia to drive discovery and decision-making. When new advanced analytic techniques are described in scientific journals, they can be added to Watershed’s platform immediately as templates, making cutting-edge tools more accessible and collaborative for researchers of all backgrounds.

“The data in biology is growing exponentially, and the sequencing technologies generating this data are only getting better and cheaper,” Wang says. “Coming from MIT, this issue was right in my wheelhouse: It’s a tough technical problem. It’s also a meaningful problem because these people are working to treat diseases. They know all this data has value, but they struggle to use it. We want to help them unlock more insights faster.”

No code discovery

Wang expected to major in biology at MIT, but he quickly got excited by the possibilities of building solutions that scaled to millions of people with computer science. He ended up earning both his bachelor’s and master’s degrees from the Department of Electrical Engineering and Computer Science (EECS). Wang also interned at a biology lab at MIT, where he was surprised how slow and labor-intensive experiments were.

“I saw the difference between biology and computer science, where you had these dynamic environments [in computer science] that let you get feedback immediately,” Wang says. “Even as a single person writing code, you have so much at your fingertips to play with.”

While working on machine learning and high-performance computing at MIT, Wang also co-founded a high frequency trading firm with some classmates. His team hired researchers with PhD backgrounds in areas like math and physics to develop new trading strategies, but they quickly saw a bottleneck in their process.

“Things were moving slowly because the researchers were used to building prototypes,” Wang says. “These were small approximations of models they could run locally on their machines. To put those approaches into production, they needed engineers to make them work in a high-throughput way on a computing cluster. But the engineers didn’t understand the nature of the research, so there was a lot of back and forth. It meant ideas you thought could have been implemented in a day took weeks.”

To solve the problem, Wang’s team developed a software layer that made building production-ready models as easy as building prototypes on a laptop. Then, a few years after graduating MIT, Wang noticed technologies like DNA sequencing had become cheap and ubiquitous.

“The bottleneck wasn’t sequencing anymore, so people said, ‘Let’s sequence everything,’” Wang recalls. “The limiting factor became computation. People didn’t know what to do with all the data being generated. Biologists were waiting for data scientists and bioinformaticians to help them, but those people didn’t always understand the biology at a deep enough level.”

The situation looked familiar to Wang.

“It was exactly like what we saw in finance, where researchers were trying to work with engineers, but the engineers never fully understood, and you had all this inefficiency with people waiting on the engineers,” Wang says. “Meanwhile, I learned the biologists are hungry to run these experiments, but there is such a big gap they felt they had to become a software engineer or just focus on the science.”

Wang officially founded Watershed in 2019 with physician Mark Kalinich ’13, a former classmate at MIT who is no longer involved in day-to-day operations of the company.

Wang has since heard from biotech and pharmaceutical executives about the growing complexity of biology research. Unlocking new insights increasingly involves analyzing data from entire genomes, population studies, RNA sequencing, mass spectrometry, and more. Developing personalized treatments or selecting patient populations for a clinical study can also require huge datasets, and there are new ways to analyze data being published in scientific journals all the time.

Today, companies can run large-scale analyses on Watershed without having to set up their own servers or cloud computing accounts. Researchers can use ready-made templates that work with all the most common data types to accelerate their work. Popular AI-based tools like AlphaFold and Geneformer are also available, and Watershed’s platform makes sharing workflows and digging deeper into results easy.

“The platform hits a sweet spot of usability and customizability for people of all backgrounds,” Wang says. “No science is ever truly the same. I avoid the word product because that implies you deploy something and then you just run it at scale forever. Research isn’t like that. Research is about coming up with an idea, testing it, and using the outcome to come up with another idea. The faster you can design, implement, and execute experiments, the faster you can move on to the next one.”

Accelerating biology

Wang believes Watershed is helping biologists keep up with the latest advances in biology and accelerating scientific discovery in the process.

“If you can help scientists unlock insights not a little bit faster, but 10 or 20 times faster, it can really make a difference,” Wang says.

Watershed is being used by researchers in academia and in companies of all sizes. Executives at biotech and pharmaceutical companies also use Watershed to make decisions about new experiments and drug candidates.

“We’ve seen success in all those areas, and the common thread is people understanding research but not being an expert in computer science or software engineering,” Wang says. “It’s exciting to see this industry develop. For me, it’s great being from MIT and now to be back in Kendall Square where Watershed is based. This is where so much of the cutting-edge progress is happening. We’re trying to do our part to enable the future of biology.”

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Watershed Bio 生物数据分析 AI 无代码平台 科学发现 生物信息学 Watershed Bio Biological Data Analysis AI No-Code Platform Scientific Discovery Bioinformatics
相关文章