BuildSpeak每日 builder 文摘
今日归档生词本关于
🎙 播客No Priors· 2026 年 6 月 10 日· 10,329 词 · 约 52 分钟

Biohub: The Future of Biology is Open-Source with Co-Founders Mark Zuckerberg, Priscilla Chan, and Head of Science Alex Rives

SPACE 播放 / 暂停·←→ 上一句 / 下一句
Speaker 100:00 - 00:02
We just wanna give tools to the whole scientific community.
Speaker 100:00 - 00:02
我们只是想把工具提供给整个科学界。
Speaker 200:02 - 00:17
We wanna understand how biology works. I wanna understand the genetics of this person. I wanna understand the risks they have to different illnesses. My goal is to be able to treat the individual as an individual, understand the mechanisms, and be able to intervene.
Speaker 200:02 - 00:17
我们想理解 biology 是如何运作的。我想理解这个人的 genetics(遗传学)。我想理解他们对不同疾病有哪些风险。我的目标是能够把个体真正当作个体来治疗,理解其中的机制,并且能够进行干预。
Speaker 100:17 - 00:34
We'll have a bigger impact by getting us in more scientists hands quicker by doing it as open source projects instead. It's not just like there's some factory somewhere that you can pay to produce the data. You actually need to invent new novel scientific approaches. The theory isn't that we're gonna cure the diseases. We're not.
Speaker 100:17 - 00:34
通过把这些做成 open source projects(开源项目),更快地交到更多科学家手中,我们会产生更大的影响。这并不只是像某个地方有一家工厂,你付钱就能产出数据。你实际上需要发明全新、原创的科学方法。我们的理论并不是说我们会治愈这些疾病。我们不会。
Speaker 100:34 - 00:36
It's that we wanna help accelerate the pace of progress for the
Speaker 100:34 - 00:36
而是说,我们想帮助加快整个
Speaker 300:36 - 00:42
whole scientific field. We folded over 1,100,000,000 proteins and predicted their structures,
Speaker 300:36 - 00:42
科学领域的进步速度。我们折叠了超过 1,100,000,000 个 protein(蛋白质),并预测了它们的结构,
Speaker 400:42 - 00:50
and we didn't design a model for antibodies. We didn't design a model to be able to bind one particular target. We just designed a model that could understand proteins.
Speaker 400:42 - 00:50
而且我们并没有专门为 antibodies(抗体)设计一个模型。我们也没有设计一个能够结合某个特定 target(靶点)的模型。我们只是设计了一个能够理解 protein(蛋白质)的模型。
Speaker 200:50 - 00:58
If we could design a protein to actually change the physiology, then we can actually cure someone.
Speaker 200:50 - 00:58
如果我们能够设计出一种 protein(蛋白质)来真正改变生理机能,那么我们就真的可以治愈患者。
Speaker 501:05 - 01:18
Today on No Priors, we're joined by Mark Zuckerberg, Priscilla Chan, and Alex Reeves. We'll be talking about Biohub and all their various efforts to now start applying AI at scale to do world models of cells and different levels of interactions across biology.
Speaker 501:05 - 01:18
今天在 No Priors 节目中,和我们一起的是 Mark Zuckerberg、Priscilla Chan 和 Alex Reeves。我们将讨论 Biohub,以及他们目前开展的各种工作:开始大规模应用 AI,来构建关于细胞的 world models(世界模型),以及 biology 各个层级上不同相互作用的模型。
Speaker 401:19 - 01:21
Mark, Priscilla, thank you for doing this.
Speaker 401:19 - 01:21
Mark、Priscilla,感谢你们来做这个采访。
Speaker 101:22 - 01:22
Thanks for having
Speaker 101:22 - 01:22
感谢邀请。
Speaker 401:22 - 01:23
It us
Speaker 401:22 - 01:23
我们
Speaker 301:23 - 01:23
was fun.
Speaker 301:23 - 01:23
很开心。
Speaker 401:24 - 01:26
Congratulations on new missions.
Speaker 401:24 - 01:26
恭喜你们开启新的使命。
Speaker 301:26 - 01:26
Thank you.
Speaker 301:26 - 01:26
谢谢。
Speaker 401:26 - 01:41
You guys made Biohub your primary philanthropic effort and then committed $500,000,000 to this virtual biology initiative. Can you tell us a little bit about, you know, why do that and how did you go from we should fund this to this is like
Speaker 401:26 - 01:41
你们把 Biohub 作为最主要的 philanthropic effort(慈善投入),随后又向这个 virtual biology initiative(虚拟生物学项目)承诺投入 $500,000,000。你们能不能稍微讲讲,为什么要这么做?以及你们是如何从“我们应该资助这件事”走到“这就像是
Speaker 201:41 - 02:04
who we are? So Biohub in its current form, we're super excited about. We feel like it's a really good fit for who we are and what we bring to the table and what we can achieve together. But this work started ten years ago when we were thinking about how can we give back? And wanted to build an organization that could cure, prevent, and manage all disease by the end of the century.
Speaker 201:41 - 02:04
我们是谁”的?所以,就 Biohub 目前的形式而言,我们对此非常兴奋。我们觉得这非常契合我们是谁、我们能带来什么,以及我们一起能够实现什么。但这项工作开始于十年前,当时我们在想:我们该如何回馈社会?并且想要建立一个组织,能够在本世纪结束前治愈、预防并管理所有疾病。
Speaker 202:05 - 02:15
And we had a series of hilarious meetings with scientists that, like, famous Nobel Prize winning scientists were just laughing at us.
Speaker 202:05 - 02:15
而且我们和科学家开过一系列特别好笑的会,那些著名的诺贝尔奖得主科学家简直就是在当面笑我们。
Speaker 402:15 - 02:17
Was that your starting line? We're just gonna cure all disease?
Speaker 402:15 - 02:17
那是你们的起点吗?我们就是要治愈所有疾病?
Speaker 102:17 - 02:34
No. No. And to be clear, we don't think that we're gonna be the ones curing the diseases. Our goal is always to build tools that could accelerate the whole scientific field, so that way the scientific field collectively could cure all the diseases. But still people thought that by the end of the century it was a stretch.
Speaker 102:17 - 02:34
不是,不是。说清楚一点,我们并不认为会由我们来治愈这些疾病。我们的目标一直都是打造能够加速整个科学领域发展的工具,这样一来,科学界整体就有可能治愈所有疾病。只是当时人们仍然觉得,到本世纪末实现这一点都太夸张了。
Speaker 102:34 - 02:35
Now I think it's like too conservative.
Speaker 102:34 - 02:35
现在我觉得,那种看法反而太保守了。
Speaker 202:36 - 02:56
And so we kept being like, okay, well, we had these series of funny, awkward educational conversations where we were like, okay, but like, why? Like, why do you think it's impossible? And like, you know, just being the person in the room is just like, oh, I don't know why. You tell me. Finally, we got people to like, they're like, fine.
Speaker 202:36 - 02:56
所以我们就一直在进行一连串有点好笑、又有点尴尬的科普式对话,我们会说,好吧,但为什么?为什么你觉得这不可能?而在场的人通常就会是那种,哦,我也不知道为什么。你来告诉我。最后,我们终于让人们说出了他们的想法,他们会说,行吧。
Speaker 202:56 - 03:08
If you really must know. And we're like, you know, we do. It seems important. You know, they were like, well, we work in silos. And when you publish, information doesn't get shared.
Speaker 202:56 - 03:08
如果你们非要知道的话。而我们会说,你知道,我们确实想知道。这似乎很重要。然后他们会说,好吧,我们彼此是分 silo(孤岛)工作的。而且论文发表之后,信息也不会被共享。
Speaker 203:08 - 03:32
It gets locked up for long periods of time. And we don't have tooling. You know, they gave the example of like, we build a great tool by one postdoc in a lab and it lives on their computer. And when they graduate, the tool is gone. And we heard was very hard to build shared tools to move science faster, build a shared knowledge base to quickly move science faster.
Speaker 203:08 - 03:32
信息会被锁住很长时间。我们也没有合适的 tooling(工具体系)。他们举的一个例子是,实验室里某个 postdoc(博士后)做出了一个很棒的工具,但它就只存在于那个人的电脑上。等他毕业离开,这个工具也就没了。我们听到的核心问题是:要构建共享工具来加快科学进展非常困难,要建立共享的 knowledge base(知识库)来迅速推动科学发展也非常困难。
Speaker 203:33 - 03:39
And that's sort of where we began in thinking about, okay, like if those are the problems, like what can we contribute?
Speaker 203:33 - 03:39
这大概就是我们开始思考的地方:如果问题是这些,那我们能贡献什么?
Speaker 103:40 - 04:03
Yeah. I mean, so the original Biohub model was basically focus on long term tool development by bringing together engineers and scientists across multiple universities to focus on long term tool development. And basically, it like worked. And, you know, we started off with with CZI doing a number of different things. And I think over time, we just felt like, okay.
Speaker 103:40 - 04:03
对。我的意思是,最初的 Biohub 模式,基本上就是通过把多所大学的工程师和科学家聚到一起,专注于长期 tool development(工具开发)。而事实证明,这个模式基本上奏效了。然后,一开始我们也通过 CZI 做了很多不同的事情。我想随着时间推移,我们只是逐渐觉得,好吧。
Speaker 104:03 - 04:42
The science piece is really working, and we just kept on investing more and more and more in it until now it is basically the primary and main thing that we're doing. And we've expanded the original San Francisco Biohub to a handful now at this point. There's New York. There's Chicago. The real focus and the unifying theme at this point the virtual biology initiative around taking the unique data sets that are able to be generated in order to model effectively starting with the smallest pieces of proteins, but then eventually cells and whole biological systems.
Speaker 104:03 - 04:42
科学这部分确实非常奏效,所以我们就不断、不断、再不断地加大投入,直到现在它基本上已经成了我们正在做的最核心、最主要的事情。到目前为止,我们也已经把最初的 San Francisco Biohub 扩展到了好几个点。现在有 New York,也有 Chicago。当前真正的重点和统一主题,是 virtual biology initiative,围绕如何利用能够生成的独特数据集来进行有效建模,从最小的 protein(蛋白质)片段开始,最终扩展到 cell(细胞)和完整的生物系统。
Speaker 104:42 - 05:24
But that's kind of how we've evolved, this idea that we talk about around that some of this is an AI problem, and you want to build a frontier AI lab, but you need to couple that with a frontier biology effort that can do the work of basically being able to understand and get the data that you need to actually be able to build these models. Because unlike language models, there's just like a lot of data out there on the Internet. That's not really the case with biology. I mean, there are obviously a bunch of different data sets that exist that academia and scientists have generated over the the decades, but a lot of the stuff that I think we wanna put into this, it doesn't exist. Right?
Speaker 104:42 - 05:24
不过这大致就是我们的演进方式:我们一直在谈这样一个想法——其中一部分是 AI 问题,你需要建设一个前沿的 AI lab,但你也必须把它和一个前沿的 biology 工作结合起来,后者要能够完成这样一种工作:真正理解并获取你构建这些模型所需要的数据。因为和 language model(语言模型)不同,互联网上本来就有大量现成数据。但 biology 并不是这样。我的意思是,显然 academia 和科学家在过去几十年里已经生成了很多不同的数据集,但我们想放进这里的很多东西,其实并不存在。对吧?
Speaker 105:24 - 06:10
It's like you wanna be able to visualize things that people haven't been able to see before, which is why we were doing the the imaging work. You wanna be able to record things that are going on inside the body, which is why we're doing the kind of cellular engineering work. Or you want to be able to measure things like inflammation in ways that haven't been possible, which is why the Chicago Biohub is focused on building those kind of devices and being able to do that. And that will fundamentally create new types of datasets that will allow new types of models. And I think it's just a very exciting thing that, going back to what you're saying, if the scientific field, it primarily needs kind of tool development that now is going to empower scientists across the field to be able to do their work faster, that's what we think we can provide through this kind of long term focus on tool development.
Speaker 105:24 - 06:10
这就像是,你希望能够把人们以前看不到的东西可视化出来,这也是为什么我们在做 imaging(成像)方面的工作。你希望能够记录身体内部正在发生的事情,这也是为什么我们在做那类 cellular engineering(细胞工程)工作。或者你想以过去做不到的方式去测量像 inflammation(炎症)这样的东西,这也是为什么 Chicago Biohub 正专注于构建这类设备并实现这些能力。而这从根本上会创造出新类型的数据集,从而支持新类型的模型。我觉得这件事非常令人兴奋。回到你刚才说的,如果科学领域当前最主要需要的是某种 tool development(工具开发),而这些工具将使整个领域的科学家都能更快地开展工作,那么这正是我们认为自己可以通过这种长期聚焦于工具开发来提供的东西。
Speaker 206:10 - 07:08
But I think there's a fun through line on where we started and, you know, bringing us to our work that Alex is driving now, is that our very first request for application, RFA here, was around single cell sequencing. And we wanted to look at sort of like the RNA that is transcribed in individual cells. And that was possible, but it was still pretty early on in understanding how different cells were expressing their DNA to the point where at the beginning we were just funding methods, like getting people to describe how to do it so that others could share that methodology. And then that became us funding the Human Cell Atlas, which is now one of the largest databases of single cell transcriptomes, it was getting hard for scientists to annotate the data. So we built CellByGene, which was like a very simple annotation tool that scientists could use to make use of that data.
Speaker 206:10 - 07:08
但我觉得,从我们最初起步的地方,到如今 Alex 正在推动的工作之间,其实有一条很有意思的贯穿主线。因为我们在这里发出的第一份 RFA,也就是 request for application,是围绕 single cell sequencing(单细胞测序)的。我们想研究的是单个细胞中被转录出来的 RNA。那在当时是可行的,但对于不同细胞如何表达其 DNA 的理解仍然非常早期,以至于一开始我们资助的其实只是方法本身——比如让人们描述应该怎么做,这样其他人就可以共享那套方法。后来,这演变成我们资助 Human Cell Atlas;它现在已经是最大的 single cell transcriptomes(单细胞转录组)数据库之一。再后来,科学家开始很难对这些数据进行 annotation(注释),于是我们构建了 CellByGene,这是一种非常简单的 annotation 工具,科学家可以用它来利用这些数据。
Speaker 207:08 - 07:47
Then a community came around Cell by Gene, built around Cell by Gene, and started contributing more and more data that we had nothing to do with sort of creating or funding or making happen in the world. And now Cell by Gene is a corpus of knowledge that a lot of the transcriptomic based models are based off of and is used regularly by the scientific community. But still there are always critiques. Like, this is just stamp collecting. Like, you're just gathering bits of knowledge sorry, bits of data, and we're not going to be able to pull scientific knowledge and wisdom and insights out of.
Speaker 207:08 - 07:47
随后,围绕 Cell by Gene 形成了一个社区,这个社区开始不断贡献越来越多的数据,而这些数据的创建、资助或在现实中的产生,其实都和我们没有直接关系。现在,Cell by Gene 已经成了一套知识 corpus(语料/知识库),许多基于 transcriptomic(转录组)数据的模型都建立在它之上,科学界也经常使用它。不过批评始终存在。比如有人会说,这不过是在集邮而已。你们只是在收集零碎的知识——抱歉,是零碎的数据——而我们并不能从中提炼出科学知识、智慧和洞见。
Speaker 207:47 - 08:25
And we're like, well, we didn't have an answer for a while. And then imagine our delight when large language models became a huge topic of conversation that could make sense of large amounts of data. And I just for me, it was like, what if we could actually understand how biology worked? Move it from a discovery based science to an engineering based science where we could systematically understand how living beings, living cells worked and be able to understand why things go wrong. And so when we saw that moment, we're like, this is it.
Speaker 207:47 - 08:25
而我们的想法是,好吧,有一段时间我们确实没有答案。后来,当 large language models(大语言模型)成为一个能够理解海量数据的热门话题时,你可以想象我们有多兴奋。对我来说,那感觉就像:如果我们真的能够理解 biology 是如何运作的呢?把它从一门以 discovery(发现)为基础的科学,推进成一门以 engineering(工程)为基础的科学;让我们能够系统性地理解生命体、活细胞是如何工作的,并且能够理解事情为什么会出错。所以当我们看到那个时刻出现时,我们就觉得,就是它了。
Speaker 208:25 - 08:27
Something really big could happen here.
Speaker 208:25 - 08:27
这里真的可能会发生一件非常重大的事情。
Speaker 408:27 - 08:43
Alex, you started at MetaFair, but you were on the path to, you know, you'd assemble the team at evolutionary scale and you'd raise venture and you were making progress in your models. What was the pitch from Mark and Priscilla where you said like, that's actually the right way to go after the mission?
Speaker 408:27 - 08:43
Alex,你是从 MetaFair 开始的,但当时你也已经走在另一条路上:你会在 evolutionary scale 组建团队,你会融资,而且你的模型也在取得进展。Mark 和 Priscilla 当时是怎么向你阐述的,才让你觉得,那才是真正能够推进这个 mission(使命)的正确路径?
Speaker 308:43 - 09:30
Well, I think for me it was really kind of the moment when I understood that, you know, they they really saw this as as an integration of frontier AI and frontier biology. And I think I had developed conviction that, you know, this is really a new era of science that's just beginning, kind of what's gonna be possible with artificial intelligence. And, you know, we're in the age of information theory at scale, and we have these systems that can basically kind of predict the next token, and they can, learn world models from that. They can learn biology from the data. And so, you know, I think that it was really clear that, you know, to build kind of that next kind of institution for the next era, you would really need to have frontier artificial intelligence.
Speaker 308:43 - 09:30
嗯,我觉得对我来说,真正关键的时刻是我意识到,他们确实把这件事看作是 frontier AI 和 frontier biology 的结合。我也逐渐形成了一种确信:这真的是一个刚刚开始的科学新时代,也就是 artificial intelligence 将带来什么样的可能性。我们正处在大规模 information theory(信息论)的时代,而且我们拥有这样一些系统,它们基本上可以预测下一个 token(词元),并由此学习 world models(世界模型)。它们可以从数据中学习 biology。因此我觉得很清楚,如果要为下一个时代建立那种下一代机构,你就真的需要 frontier artificial intelligence。
Speaker 309:30 - 09:45
You would have to have frontier biology. You would need to start to put those things in feedback and really have models that are learning from the biology. And I think, you know, And you need the right scale on the right people. And so this just really felt, I think, like the way to do that.
Speaker 309:30 - 09:45
你还必须拥有 frontier biology。你需要开始把这两者放进反馈回路里,真正让模型从 biology 中学习。而且我认为,你还需要合适规模以及合适的人才。所以这件事整体上就让人觉得,这确实是实现这一目标的方式。
Speaker 509:45 - 10:09
There's a variety of different models that you all have been working on. And I think it's kind of interesting because some of the earliest breakthroughs in biology were things like AlphaFold, where there was a Google model that showed that you could do protein folding at scale in a really interesting way that people didn't realize was very tractable. And this was pre sort of the really big transformer waves that came later. And then you're working on a variety of different things at different scale, right? You're doing incremental molecular modeling and protein folding.
Speaker 509:45 - 10:09
你们一直在研究各种不同的模型。我觉得这挺有意思的,因为 biology 领域一些最早的突破之一就是 AlphaFold,也就是一个 Google 的模型,它表明你可以用一种非常有意思的方式大规模地做 protein folding,而且这是此前很多人没有意识到其实非常 tractable(可处理)的。这还是在后来那波真正的大型 transformer 浪潮之前。然后你们现在也在做很多不同尺度上的事情,对吧?你们在做渐进式的 molecular modeling 和 protein folding。
Speaker 510:09 - 10:25
You're doing cell based stuff. You're thinking about interrogating larger scale systems in biology. How well do you think that extends from sort of the micro to the macro? You mentioned almost starting with building blocks and building up, but modeling cellular behavior is very different from modeling protein folding. The data is very different, the modeling is different.
Speaker 510:09 - 10:25
你们也在做基于 cell 的工作。你们还在考虑如何探查 biology 中更大尺度的系统。你觉得这种能力从微观延伸到宏观的效果有多好?你提到几乎是从 building blocks 开始一路往上构建,但对 cellular behavior 建模和对 protein folding 建模非常不同。数据非常不同,建模方式也不同。
Speaker 510:25 - 10:35
I'm just curious, do you think it's all similar in terms of it's just data and you train stuff? Or do you think there's some differences in terms of how you actually have to deal with these systems?
Speaker 510:25 - 10:35
我只是很好奇,你觉得这些在本质上都差不多吗,也就是“反正都是数据,然后训练东西”?还是说,你认为在实际处理这些系统时,还是存在一些差异?
Speaker 110:35 - 10:50
I mean, are probably some differences. I mean, you can probably talk more to the specifics around this. But I mean, I think each layer is gonna end up being somewhat qualitatively different. Right? But you need to be able to understand the protein interactions in order to be able to understand how cells work.
Speaker 110:35 - 10:50
我的意思是,可能确实会有一些差异。我想,你大概可以更具体地谈谈这方面。不过我觉得,每一层最终都会在性质上有些不同,对吧?但你必须能够理解 protein 之间的相互作用,才能理解 cell 是如何工作的。
Speaker 110:50 - 11:20
So you can't just go straight to cells in a way without understanding the protein modeling. And then if you're trying to understand something like the, you know, the way the immune system works or a bunch of cells interact together, then, you know, it's tough to do that without first understanding cells. I mean, you might be able to, at, like, a very high level of abstraction, simulate a system. But if you really wanna, like, understand how it's gonna work, you kind of wanna build the simulations at each level hierarchically. So that's basically the approach that we're going through starting with the building blocks and the and the protein.
Speaker 110:50 - 11:20
所以,如果不先理解 protein modeling,你就不能直接跳到 cells 这一层去。然后如果你想理解比如 immune system 是如何工作的,或者一群 cells 是如何彼此交互的,那么如果不先理解 cells,本身就很难做到这一点。我的意思是,你也许可以在一个非常高层次的抽象层面上去模拟一个系统。但如果你真的想理解它将如何运作,你其实会希望在每一个层级上以分层的方式来构建模拟。所以这基本上就是我们正在采取的方法:从 building blocks 和 protein 开始。
Speaker 111:20 - 11:34
But yeah. I mean, I think that there's gonna be different types of data that you wanna collect for each. The modeling techniques, think we'll see. I mean, that'll all keep on advancing across the board. But I do think that, like, a big part of the strategy is this view that you need to build it up hierarchically.
Speaker 111:20 - 11:34
不过,是的。我觉得你会希望为每一层收集不同类型的数据。至于 modeling techniques,我想我们还要继续观察。我的意思是,这些方法都会在各个方向上持续进步。但我确实认为,这个策略里很重要的一部分,就是这种观点:你需要以分层的方式把它一步步构建起来。
Speaker 211:35 - 12:13
And, you know, one of the things that's unique about us in this space is we are very intentional that the AI efforts and the wet lab efforts were a single effort. And we've done a lot of work to bring them together. And the really neat thing that we can do is really try to pull and gather data that helps us connect across sort of the hierarchy. You know, you can look at transcriptomics with space within a cell and look at where it's localizing. We can look at translucent zebrafish and look at the development across different cells and when the brain develops.
Speaker 211:35 - 12:13
而且,你知道,我们在这个领域里有一点很独特:我们非常有意识地把 AI 方面的工作和 wet lab 方面的工作当作同一项工作。我们也做了很多工作把两者整合起来。而我们真正能做的一件很棒的事,就是尽量去提取和汇集数据,帮助我们跨越这种层级结构建立联系。你知道,你可以在细胞内部、带着空间信息去看 transcriptomics(转录组学),看它定位在什么地方。我们也可以观察 translucent zebrafish,去看不同细胞之间的发育过程,以及大脑是在什么时候发育出来的。
Speaker 212:13 - 12:35
We have sensors that allow us to look at cell cell communication in different molecules. And so we can be strategic about the types of experiments and data we want to collect that helps us bridge across these, that makes it so that there's some connective tissue that helps drive the modeling that you know, the modeling magic that happens.
Speaker 212:13 - 12:35
我们有一些 sensors,能够让我们观察细胞与细胞之间在不同分子层面的 communication(通讯)。所以我们可以有策略地设计想做的实验类型,以及想收集的数据类型,帮助我们在这些层面之间搭桥,让其中存在一些 connective tissue(连接组织/纽带),从而推动 modeling,也就是你知道的,那种会发生的 modeling magic。
Speaker 512:35 - 12:40
Yeah. The reason I asked the question, by the way, is I used to be a biologist. I have a PhD in biology, and I worked in wet labs for almost a decade and everything else.
Speaker 512:35 - 12:40
对了,我之所以问这个问题,是因为我以前是个生物学家。我有 biology 的 PhD,而且我在 wet lab 里工作了将近十年,除此之外也做过很多别的事情。
Speaker 212:40 - 12:42
Are looking for a job.
Speaker 212:40 - 12:42
你是在找工作吗。
Speaker 512:46 - 12:48
We can talk about that later.
Speaker 512:46 - 12:48
我们可以稍后再聊这个。
Speaker 112:48 - 12:49
It's not a no. At
Speaker 112:48 - 12:49
也不是不行。到
Speaker 512:49 - 12:49
this point
Speaker 512:49 - 12:49
目前这个
Speaker 112:49 - 12:51
in my career, know?
Speaker 112:49 - 12:51
职业阶段,你知道吗?
Speaker 212:51 - 12:52
Hearing, I
Speaker 212:51 - 12:52
听到这点,我
Speaker 112:52 - 12:53
love my aggressive reaction.
Speaker 112:52 - 12:53
很喜欢我这种有点激烈的反应。
Speaker 512:53 - 13:15
I'm like Danny Glover, you know, and Lisa Waffen. I'm almost at retirement. But I think one of the things that was always lacking was this integrative nature across the different layers of biology, and the developmental biologists would work on their own, the molecular biologists would be doing different experiments. And so that's what I was curious about. Typically, there's a reductionist view of biology and there's a systems view, and those people didn't really work together deeply.
Speaker 512:53 - 13:15
我有点像 Danny Glover,你知道,还有 Lisa Waffen。我都快退休了。但我觉得一直以来有所欠缺的一点,是在 biology 不同层级之间这种整合性的特质;developmental biologists 往往各做各的,molecular biologists 也在做不同的实验。所以这正是我感到好奇的地方。通常,人们对 biology 要么采取 reductionist view(还原论视角),要么采取 systems view(系统视角),而这两类人其实并没有真正深入地合作。
Speaker 513:15 - 13:20
And so one of the exciting things about what you're doing actually is how you're bridging that. And so that was kind of the basis for the question as well.
Speaker 513:15 - 13:20
所以,其实你们正在做的一件令人兴奋的事,就是你们正在弥合这一点。这也是我提出那个问题的一部分原因。
Speaker 313:20 - 14:14
Yeah. And if I could add something there, you know, it's I think that, you know, we're in the age of this kind of information theory in biology. And so, you know, there there are levels of of complexity and hierarchy in biology, and kind of each level is is made up of and, you know, constituted by the lower levels. And so as you want to have that kind of more complete description and you want to have systems that can really generalize and begin to actually answer, you know, experimental questions digitally that you could ask in the lab, you need to have kind of the right basis for modeling at every level. And so I think what's really unique about what we can do is to, as Priscilla and Mark were saying, really build information at each of these different layers, collect them, collect kind of those connection points, but then also really kind of do it at the scale that will reveal that underlying information architecture.
Speaker 313:20 - 14:14
对。如果我可以补充一点,我认为我们正处在 biology 中这种 information theory(信息理论)时代。biology 里存在不同层次的复杂性与层级结构,而每一层某种意义上都是由更低层构成、并由其定义的。因此,如果你想获得那种更完整的描述,想要建立真正能够泛化、并开始以数字化方式回答实验问题的系统——也就是回答那些你本来会在 lab 里提出的问题——那你就需要在每一个层级上都有正确的建模基础。所以我觉得,我们真正独特的一点在于,正如 Priscilla 和 Mark 所说,我们能够在这些不同层面上真正构建信息,把它们收集起来,收集这些连接点,同时还能以足够揭示底层信息架构的规模来完成这件事。
Speaker 314:14 - 14:21
And that's gonna be really critical to actually be able to build digital representations that can answer new experimental questions.
Speaker 314:14 - 14:21
而这对于真正构建能够回答全新实验问题的数字化表征,将会是非常关键的。
Speaker 414:22 - 15:01
One of the things that inspires me most about this effort is really what Priscilla said, which is like, well, there's so much we actually understand about biology and what if we could? Which I think is actually very different from lots of other incredibly interesting and useful AI problems we attack where we're like trying to replicate human behavior. I'm like, a lot of that data's, you know, on the internet or captured. And without pretending to understand all human behavior, can predict a lot of it. I thought one of the most interesting things in your release was actually, you know, the mechanistic interpretability stuff you alluded to, which is, can we actually extract new knowledge from, you know, what the model believes is happening?
Speaker 414:22 - 15:01
这项工作最激励我的一点,其实正是 Priscilla 说的那件事:我们其实已经理解了很多 biology,那么如果我们能进一步利用这一点,会怎样?我觉得这和很多其他同样非常有趣、也很有用的 AI 问题非常不同;在那些问题上,我们往往是在试图复制 human behavior(人类行为)。我的看法是,这类数据很多都已经在 internet 上,或者已经被采集下来了。即便不假装自己理解了全部 human behavior,我们也能预测其中相当大的一部分。我觉得你们这次发布中最有意思的一点之一,其实是你提到的 mechanistic interpretability(机制可解释性)相关内容:我们是否真的能够从模型所“认为”正在发生的事情中,提取出新的知识?
Speaker 415:01 - 15:03
Right? Can you talk a little bit about that?
Speaker 415:01 - 15:03
对吧?你能稍微谈谈这个吗?
Speaker 315:03 - 15:34
Yeah, I'm really excited about that. So I think, you know, in mechanistic interpretability, kind of traditionally it's been applied to large language models with the goal of understanding, you know, kind of what is the representation space of a large language model? How does it compute things, and does that really connect to, you know, what we understand about our intuitive understanding of the world? And so there's, I think, this really rich toolkit that has been developed to start to be able to ask those questions. So kind of what does that mean for biology?
Speaker 315:03 - 15:34
是的,我对此真的很兴奋。所以我想,你知道,在 mechanistic interpretability(机制可解释性)这个领域里,传统上它主要被应用于 large language model(大语言模型),目标是去理解,比如说,大语言模型的 representation space(表征空间)究竟是什么样的?它是如何进行计算的?而这些又是否真的能与我们对世界的直觉性理解建立联系?所以我认为,为了开始提出这类问题,人们已经发展出了一套非常丰富的工具箱。那么这对 biology(生物学)意味着什么呢?
Speaker 315:34 - 16:05
One of the classes of models that we train are these protein language models. So they're really, you know, trained on the codes of proteins. And so anything they learn about biology is is kind of emergent. And we've seen that they can learn things like biological structure and biological function, and that's just kind of emergent from this, you know, token prediction training task. So, you know, as we think about, like, mechanistic interpretability in those models, you know, we're really seeing the unknown because the models have been trained on billions of protein sequences.
Speaker 315:34 - 16:05
我们训练的一类模型是 protein language model(蛋白质语言模型)。它们本质上确实是训练在蛋白质编码之上的。因此,它们学到的任何关于 biology(生物学)的东西,某种意义上都是 emergent(涌现)的。我们已经看到,它们能够学会 biological structure(生物结构)和 biological function(生物功能)之类的东西,而这些都只是从这种 token prediction(token 预测)训练任务中涌现出来的。所以,当我们思考这些模型中的 mechanistic interpretability(机制可解释性)时,我们其实是在看到未知,因为这些模型是在数十亿条蛋白质序列上训练出来的。
Speaker 316:05 - 16:36
They've been trained on, you know, both known and unknown biology. And yet they're developing these representations that start to kind of capture things that we can really see correspond to that reductive picture of biology that's been built up over the centuries. So kind of you can start to connect the dots between proteins where we kind of really don't know anything about them with proteins where we do know something because there's that kind of underlying structure grammar that's linking them in the representation space of the model.
Speaker 316:05 - 16:36
它们是在已知和未知的 biology(生物学)上一起训练的。然而,它们却发展出了这样的 representations(表征):开始能够捕捉到一些东西,而这些东西确实与几个世纪以来逐步建立起来的那种还原论式 biology(生物学)图景相对应。所以,你可以开始在那些我们几乎一无所知的蛋白质与那些我们已经了解一些的蛋白质之间建立联系,因为在模型的 representation space(表征空间)里,存在着一种将它们连接起来的底层结构语法。
Speaker 416:37 - 16:47
And at the extreme, it could be, you know, we're going to understand systems in the body that we didn't before or the mechanism of action for a new treatment because we can ask the model, right, interrogate that representation.
Speaker 416:37 - 16:47
再极端一点说,这甚至可能意味着,我们将能够理解以前并不了解的人体系统,或者理解一种新疗法的 mechanism of action(作用机制),因为我们可以向模型发问,对吧,去 interrogate(审视、拷问)那种表征。
Speaker 316:47 - 16:58
That's right. The hope is that you kind of really learn the underlying basis for how it's making the predictions. And so you open up the black box and you can actually understand kind of the biology that the model is representing.
Speaker 316:47 - 16:58
没错。我们的希望是,你真正能够学到它做出这些预测的底层依据。这样一来,你就打开了 black box(黑箱),并且能够实际理解模型正在表征的 biology(生物学)。
Speaker 416:58 - 17:23
So asking for a friend, you know, you guys all believe in venture backed companies as a way to have impact on the world. Was it like collecting data on zebrafish or the span of the data, or the wet lab work, or just the scale? Like what makes this a better fit for this big nonprofit, you know, ecosystem effort versus a venture backed company?
Speaker 416:58 - 17:23
这么问是替一个朋友问的,你知道,你们大家都相信 venture backed company(风险投资支持的公司)是对世界产生影响的一种方式。那这里的问题是像斑马鱼数据的采集、数据跨度、wet lab(湿实验)工作,还是单纯就是规模?到底是什么让这件事更适合放在这种大型 nonprofit(非营利)生态系统式的努力中去做,而不是交给一家 venture backed company(风险投资支持的公司)?
Speaker 117:24 - 17:27
Well, I think we just want to give tools to the whole scientific community.
Speaker 117:24 - 17:27
嗯,我想我们只是想把工具提供给整个科学界。
Speaker 417:27 - 17:29
And I mean, like, so I I think
Speaker 417:27 - 17:29
我的意思是,所以我觉得
Speaker 117:29 - 17:54
in order to have the biggest impact, I mean, part of it is just we're I mean, it's not actually clear that we couldn't run it as a business if we wanted to. I just think that we'll have a bigger impact by getting this in more scientists' hands quicker by doing it as open source projects instead. So, yeah, I mean, I think that that's kind of the approach. But I don't know. It's an interesting question.
Speaker 117:29 - 17:54
为了产生最大的影响,我的意思是,一部分原因只是我们……我的意思是,其实并不明确说明如果我们想的话,就不能把它作为一门生意来运作。我只是认为,如果改做 open source projects(开源项目),更快地把这些东西交到更多科学家手中,我们会产生更大的影响。所以,是的,我觉得大概就是这种思路。但我也不确定。这是个很有意思的问题。
Speaker 117:54 - 18:28
I'm not sure that I mean, obviously, you were doing it as a for profit company, a bunch of the modeling before, then you run into certain issues. You have to raise a large amount of money in order to build the compute clusters. I think in a lot of ways, the data is actually even more of a constraint. Because if you look at the scale of these models compared to language models, they're smaller, but they're smaller because the amount of data is less. In order to get the data, it's not just like there's some factory somewhere that you can pay to produce the data.
Speaker 117:54 - 18:28
我不太确定,我的意思是,很显然,你们之前把它作为一家 for profit company(营利性公司)在做,也做了很多建模,但随后会遇到一些问题。你必须筹集大量资金来建设 compute clusters(计算集群)。我认为在很多方面,data(数据)实际上是更大的约束。因为如果你看这些模型相对于 language models(语言模型)的规模,它们更小,但之所以更小,是因为数据量更少。为了拿到这些数据,情况并不是像某个地方有一家工厂,你付钱就能让它生产数据那样简单。
Speaker 118:28 - 19:02
You actually need to invent new, novel, scientific approaches to be able to do the, know, for example, the type of cellular engineering we're doing in New York or the types of devices in Chicago, which is why, you know, when we're talking about this concept of frontier biology and frontier AI, the frontier biology is you need to do real science to advance different biological methods in order to be able to observe the things that create the data that go into the model. Mhmm. So it's not just like an off the shelf thing that you can create. Now that's a pretty big effort. I don't know that there are, like, that many things like that that are done as as biotechs.
Speaker 118:28 - 19:02
你实际上需要发明新的、原创的 scientific approaches(科学方法),才能去做——比如说——我们在 New York 做的那类 cellular engineering(细胞工程),或者 Chicago 的那类 devices(设备)。这也是为什么,当我们谈到 frontier biology(前沿生物学)和 frontier AI(前沿 AI)这个概念时,所谓 frontier biology,指的是你必须真正去做科学研究,推进不同的生物学方法,才能观察到那些会形成数据、并输入模型的事物。嗯。所以这不是那种现成 off the shelf(现成可买)的东西,可以随手做出来。这本身就是一项相当大的工作。我不确定像这样的事情,有多少是以 biotechs(生物科技公司)的形式在做的。
Speaker 119:02 - 19:38
I think it's just the scale of the ambition of what we're doing, the time horizon over which we're committed to doing it. I think part of the theory is, like, if you're building tools that are this complicated, you kinda wanna have a ten to fifteen year time horizon on on building out these efforts. And then the scale of capital required I mean, I guess there's no rule that said that you couldn't do it as an incredibly well funded startup, but I think that this just made more sense. And then it also is is simplifying strategically to not have to think about you're gonna make money with the different things. I mean, we just we wanna get the models in people's hands.
Speaker 119:02 - 19:38
我觉得关键就在于我们所做事情的 ambition(目标雄心)的规模,以及我们承诺投入其中的时间跨度。我认为其中一部分理论是:如果你在构建如此复杂的工具,你大概会希望以十到十五年的时间 horizon(周期)来推进这些工作。再就是所需 capital(资本)的规模——我的意思是,我想也没有哪条规则说你不能把这件事做成一家资金极其充足的 startup(初创公司),但我觉得现在这种方式更合理。而且,从战略上说,不必去考虑如何通过这些不同的东西赚钱,也会让事情更简单。我的意思是,我们只是想把这些模型交到人们手中。
Speaker 119:38 - 19:50
We release them as open source. I think that that's like a very valuable thing to do. And again, I mean, the the theory isn't that we're gonna cure the diseases. We're not. It's that we wanna help accelerate the pace of progress for the whole scientific field.
Speaker 119:38 - 19:50
我们会把它们作为 open source(开源)发布。我认为这是一件非常有价值的事。再说一次,我们的理论并不是说我们自己会 cure the diseases(治愈这些疾病)。我们不会。我们的目标是帮助加快整个 scientific field(科学领域)的进步速度。
Speaker 219:50 - 20:36
As the person least experienced with making money here, I would say that there the sort of neutral nonprofit nature of our work actually helps harness more people to enter this effort. And to actually achieve the mission of like understanding the totality of human biology and to cure, prevent, manage all disease, you actually do need the entire academic biotech industry to come together and to work on this in a sort of unified way, in part because there's a lot of talent out there. And it's not helpful to leave any talent, exclude any talent from the effort. And there's a super long tail of diseases. There are the common ones.
Speaker 219:50 - 20:36
作为这里最没有赚钱经验的人,我会说,我们这种中立的 nonprofit(非营利)工作性质,实际上有助于吸引更多人加入这项努力。而要真正实现那种“理解 human biology(人类生物学)的整体,并治愈、预防、管理所有疾病”的使命,你实际上确实需要整个 academic(学术界)和 biotech industry(生物科技产业)联合起来,以某种统一的方式共同推进这件事,部分原因是外面有大量人才。而把任何人才排除在这项努力之外,都是没有帮助的。并且,疾病存在一个非常长的 long tail(长尾)。当然也有那些常见疾病。
Speaker 220:36 - 21:16
And even the common ones, I think if you unbundle heart disease, cancer, neurodegenerative diseases, even if you unbundle like dementia or depression, there are many, many, many subcategories that become more and more niche. And that's not even looking at the long, long tail of rare diseases. Those often get orphaned and don't get brought along when we're sort of looking at what the most efficient way to impact the lives of many. But if you sort of decentralize the effort and put the tools in many people's hands, you start getting people who are like, you know what? I am super interested in spinal muscular atrophy, and that's something I care deeply about.
Speaker 220:36 - 21:16
而即使是那些常见疾病,我认为如果你把 heart disease(心脏病)、cancer(癌症)、neurodegenerative diseases(神经退行性疾病)拆开来看,甚至即使你把 dementia(痴呆)或 depression(抑郁症)进一步拆开,都会出现非常非常多、越来越细分的小类别。那还根本没有涉及 rare diseases(罕见病)那条很长很长的长尾。那些疾病往往会被忽视;当我们在考虑怎样最有效地影响尽可能多人的生活时,它们通常不会被一并带上。但如果你把这项努力去中心化,把工具交到很多人手中,你就会开始看到有人会说:你知道吗?我对 spinal muscular atrophy(脊髓性肌萎缩症)特别感兴趣,而且这是我非常在乎的事情。
Speaker 221:16 - 21:41
And if you put the tools in that person's hands, they're gonna be able to make progress. In a way, if you had to focus your efforts and make big bets, you probably wouldn't because it's just a niche individual small group disease that actually will in turn, if we can understand that disease process, helps us unlock knowledge about a lot more about how the human body works.
Speaker 221:16 - 21:41
如果你把工具交到这个人手里,他们就能够取得进展。换一种方式说,如果你必须集中资源、下大的赌注,你大概率不会去做这件事,因为它只是一个很小众的、涉及少数人的疾病。但实际上,如果我们能够理解这种疾病的过程,反过来就能帮助我们解锁更多关于人体如何运作的知识。
Speaker 321:41 - 21:42
Mhmm. Do you
Speaker 321:41 - 21:42
嗯。你
Speaker 521:42 - 21:54
have any thoughts or predictions in terms of what disease areas this work will impact first? I know it's very hard to be predictive about these things. But just given the nature of the work and the nature of the models, are there areas you're most optimistic about in the short to medium term?
Speaker 521:42 - 21:54
是否对这项工作最先会影响哪些疾病领域,有什么想法或预测?我知道这类事情很难准确预测。但仅从这项工作的性质和这些模型的特点来看,在短期到中期内,你对哪些领域最乐观?
Speaker 221:55 - 22:13
That's actually not how I think about it at least. The way I think about it is like we want to understand how biology works. The ideal world is you would say, I understand the genetics of this person. So I want to think about people at the individual level. I want to understand the genetics of this person.
Speaker 221:55 - 22:13
至少对我来说,我其实不是这样思考的。我思考这个问题的方式更像是:我们想要理解 biology(生物学)是如何运作的。理想情况下,你会说,我理解这个人的 genetics(遗传学)。所以我想在个体层面去思考人。我想理解这个人的 genetics(遗传学)。
Speaker 222:13 - 22:51
I want to understand the risks they have to different illnesses. I want to understand the mechanistic connection between, say, gene variant, a protein, and a disease process. Because if you understand that through chain, then you can design a protein, design a drug, bespoke to them, and actually make an intervention. And right now, I'm sure we've all had experiences being sick. And if you have something that's even remotely nonstandard, you go into PubMed, you look up a paper, you look up the supplement, and then you start going through the methods.
Speaker 222:13 - 22:51
我想理解他们对不同疾病有哪些风险。我想理解比如 gene variant(基因变异)、protein(蛋白质)和 disease process(疾病过程)之间的机制性联系。因为如果你理解了这条链路,你就可以为他们设计一个 protein(蛋白质),设计一种 drug(药物),为其量身定制,并真正实施干预。而现在,我相信我们都经历过生病。如果你得的是哪怕稍微有点不标准的情况,你就会去 PubMed,查一篇 paper(论文),再去看 supplement(补充材料),然后开始翻 methods(方法部分)。
Speaker 222:51 - 23:11
And you're like, am I represented in this paper? And we're just making guesses. We really have no mechanistic understanding. We're saying like, okay, you're kind of like these people that we studied, and this drug kind of impacts the pathway that we think is implicated. Let's try and see if anything happens.
Speaker 222:51 - 23:11
然后你会想,我在这篇论文里被代表了吗?而我们其实只是在猜。我们真的没有机制层面的理解。我们是在说,好吧,你有点像我们研究过的这些人,而这个 drug(药物)大概会影响我们认为相关的那条 pathway(通路)。那就试试看会不会发生点什么。
Speaker 223:12 - 23:39
And time passes. And sometimes it works and sometimes it doesn't. So my goal is to be able to treat the individual as an individual, understand the mechanisms, and be able to intervene. And there are different diseases that are at different stages of filling out that whole through line. And so for some diseases, you just want to understand which gene variants actually cause disease and which don't.
Speaker 223:12 - 23:39
然后时间流逝。有时候有效,有时候无效。所以我的目标是,能够把个体当作真正的个体来治疗,理解其中的机制,并且能够进行干预。而不同疾病在补全这整条脉络这件事上,所处的阶段各不相同。所以对某些疾病来说,你首先只是想弄清楚,究竟哪些 gene variants(基因变异)真的会致病,哪些不会。
Speaker 223:39 - 24:04
And that in itself can be super empowering to patients. And if beyond that, there are some diseases where we understand the chain. We just can't intervene and change a specific protein function. That's super exciting too. Like, if we could design a protein to actually change the physiology, then we can actually cure someone.
Speaker 223:39 - 24:04
而仅这一点本身,就能给患者带来极大的赋能。再进一步,有些疾病中,我们已经理解了那条链路。我们只是还无法干预并改变某个特定 protein(蛋白质)的功能。这也非常令人兴奋。比如,如果我们能设计一种 protein(蛋白质)来真正改变 physiology(生理状态),那我们就真的有可能治愈一个人。
Speaker 224:04 - 24:12
But to me, like, that is just as exciting as understanding contributing to our understanding of like how someone gets sick in
Speaker 224:04 - 24:12
但对我来说,这和增进我们对一个人生病机制的理解一样令人兴奋。
Speaker 324:12 - 24:13
the first time.
Speaker 324:12 - 24:13
第一次。
Speaker 524:13 - 24:33
And that's a very exciting vision because you're basically saying you can bring generalizable tools to provide very personalized things for each individual person. Yes. And that's the power of the approach, is you have these big models that you build that can then apply anywhere. I know that you mentioned earlier that you were gonna try and cure prevent all diseases within one hundred years. And you mentioned, hey, could actually be sooner now given all the advances in AI.
Speaker 524:13 - 24:33
而且这是一个非常令人兴奋的愿景,因为你基本上是在说,你可以带来可泛化的工具,为每一个个体提供高度个性化的东西。是的。这种方法的力量就在于,你构建出这些大型模型,然后它们就可以应用到任何地方。我知道你之前提到过,你们会尝试在一百年内治愈并预防所有疾病。你还提到,鉴于 AI 方面的所有进展,实际上现在可能还会更早。
Speaker 524:33 - 24:37
Do you have some thought of when we think we'll be closer to that goal or something?
Speaker 524:33 - 24:37
对于我们什么时候会更接近那个目标之类的问题,你有什么想法吗?
Speaker 124:37 - 24:52
Mean, I'm optimistic it'll be sooner. I mean, I think the the thing that's complicated is that it's a dynamic system. Right? So if you fix something, there will obviously be future things that you need to work on. So I don't think that the current set of things that we're aware of are gonna be the only things that need to get worked out.
Speaker 124:37 - 24:52
我的意思是,我乐观地认为会更早。我是说,我认为这里复杂的一点在于,它是一个动态系统。对吧?所以如果你解决了某个问题,显然未来还会有你需要继续处理的事情。所以我不认为,我们当前所知道的这一组问题,会是唯一需要被解决的东西。
Speaker 124:52 - 25:19
But I don't know. I think that the progress with AI is really is is obviously very exciting on this. The other thing that I'd say, just adding to what you were saying a second ago, is we really look at more systems than than specific diseases. So for example, one area that seems really important to understand is inflammation. We talked about this a bunch.
Speaker 124:52 - 25:19
但我也不知道。我认为 AI 带来的进展,在这件事上确实、显然非常令人振奋。还有一点,我想补充你刚才说的话:我们真正关注的更多是 system(系统),而不是某一种具体疾病。比如说,一个看起来非常重要、值得理解的领域是 inflammation(炎症)。我们已经谈过很多次这个问题了。
Speaker 125:19 - 26:01
This is a big focus of the Chicago Biohub. There's a lot of data on that. It seems quite clear that it's connected to a bunch of different diseases, but rather than studying the specific diseases, we think that by trying to understand inflammation more broadly, that will make it so that other companies that can then use these tools can work on specific therapies. Another example is I think that the immune system, I think, is a very good case to study for some of the work that we're doing in cellular engineering and when we kind of ladder up from proteins to cells to, like, whole dynamic systems within the body, I think that that one makes sense. I mean, it's sort of privileged.
Speaker 125:19 - 26:01
这是 Chicago Biohub 的一个重点方向,这方面有很多数据。看起来相当清楚的是,它与很多不同的疾病都有联系;但我们认为,与其研究那些具体疾病,不如尝试更广泛地理解 inflammation(炎症),这样一来,其他公司之后就可以使用这些工具去研发具体的疗法。另一个例子是,我认为 immune system(免疫系统)是一个非常好的研究对象,适合用于我们在 cellular engineering(细胞工程)方面所做的一些工作;而且当我们从 proteins(蛋白质)逐步上升到 cells(细胞),再到身体内部那种整体性的动态系统时,我觉得它是说得通的。我的意思是,它在某种程度上是比较特殊的。
Speaker 126:01 - 26:19
It can and the cells can travel around through the body, all that. You know? So, obviously, that has a big part in addressing different diseases. How do you make the immune system function better? But exactly how do you connect that last mile I think is gonna be more something that biotech or other academics individually studying things will be better suited to do.
Speaker 126:01 - 26:19
它可以——而且这些细胞可以在身体里到处移动,诸如此类。你知道的。所以显然,它在应对不同疾病方面占有很大一部分。你要如何让 immune system(免疫系统)运作得更好?但究竟该如何打通最后一公里,我觉得这更多会是 biotech(生物技术)公司,或者分别研究具体问题的其他学术研究者,更适合去做的事情。
Speaker 126:19 - 26:25
So this is kind of how we think about building out the toolset that just helps accelerate all these other folks.
Speaker 126:19 - 26:25
所以,这大致就是我们看待如何构建这套 toolset(工具集)的方式——它就是帮助加速所有其他这些人的工作。
Speaker 426:26 - 26:51
Whether the timeline is ten years, hopefully less than one hundred now, I think this is useful for maybe your average doctor or patient, human being, everybody's a patient, to think about like what's externally visible in the progress here. You worked with patients for a long time at UCSF. Like, what should doctors look out for? What should people look out for if you're actually accelerating progress?
Speaker 426:26 - 26:51
不管这个时间线是十年,还是现在看来最好少于一百年,我觉得对于普通医生、病人,或者说普通人——其实每个人某种意义上都会成为病人——有必要去想一想:在这里的进展中,哪些是从外部可见的。你在 UCSF 和病人打了很长时间交道。那么,医生应该关注什么?如果你们确实在加速进展,人们又应该留意什么?
Speaker 226:52 - 27:27
This is the part You know, I'm super excited about the progress, especially with this launch that Alex and his team have put forth. And I think it's very clear that science is gonna start moving pretty quickly. And I think the thing that's less clear to me is exactly how we translate to the clinic and what that looks like. And I think what has to change is actually the way we do clinical research. And my hope is that we're really shortening the distance between bench research and patient impact.
Speaker 226:52 - 27:27
这部分是这样——你知道,我对这些进展非常兴奋,尤其是 Alex 和他的团队这次推出的成果。我认为现在已经非常清楚,科学的发展将开始变得相当快。但对我来说,没那么清楚的是,我们究竟要如何把这些东西转化到临床,以及那会是什么样子。我认为真正必须改变的,其实是我们开展 clinical research(临床研究)的方式。我的希望是,我们能真正缩短 bench research(台式/基础研究)与对患者产生实际影响之间的距离。
Speaker 227:28 - 28:04
But there's a lot of steps there that we need people who actually take care of patients to think creatively and think about how to deploy safely. And that's a gap that we have some work in. We partner with Jennifer Downey on our CRISPR Cures program at UCSF. So we're dipping our toe in understanding how the deployment of research needs to change given how quickly research will be progressing. But that one is still, I think, still shaping up.
Speaker 227:28 - 28:04
但在这中间还有很多步骤需要做,我们需要那些真正照护病人的人去创造性地思考,并思考如何安全地部署这些成果。这正是我们还有不少工作要做的一个空白点。我们在 UCSF 的 CRISPR Cures program 中与 Jennifer Downey 合作。所以我们现在算是在初步摸索:在研究进展速度会如此之快的前提下,研究的部署方式需要如何改变。不过我觉得,这件事目前仍然还在逐步成形。
Speaker 328:04 - 28:07
Maybe I could say something about our most recent launch
Speaker 328:04 - 28:07
也许我可以讲讲我们最近一次发布的内容。
Speaker 428:07 - 28:07
because I
Speaker 428:07 - 28:07
因为我
Speaker 328:07 - 28:08
think it
Speaker 328:07 - 28:08
觉得它
Speaker 428:08 - 28:10
also Oh, yeah, kind please. We should ask you explicitly about it.
Speaker 428:08 - 28:10
也——哦,对,当然,请讲。我们确实应该专门问问你这个。
Speaker 328:11 - 28:39
Yeah. So, you know, because I guess it was just a week ago, about now, so we announced the new ESM fold. And so this is basically an open system for scientific discovery in protein biology. It's a world model of protein biology that's been trained. It's a language model based, so it's been trained on billions of protein sequences, kind of learns these emergent representations of protein biology.
Speaker 328:11 - 28:39
对。所以,你知道,因为我想那也就是大概一周前,就在差不多这个时候,我们发布了新的 ESM fold。这基本上是一个用于 protein biology(蛋白质生物学)科学发现的开放系统。它是一个已经训练好的 protein biology(蛋白质生物学)world model(世界模型)。它是基于 language model(语言模型)的,所以是在数十亿条蛋白质序列上训练出来的,会学习蛋白质生物学中这类 emergent representations(涌现表征)。
Speaker 328:40 - 29:15
And then we can use it to make predictions of atomic resolution protein structure, and we can use it to and it's really fast. So it's blazing fast. So it's kind of you know, illustrating this Pareto optimal frontier of kind of speed and accuracy in structure prediction. And so this allows us to kind of characterize, you know, really vast kind of stretches of the protein universe. So we folded over 1,100,000,000 proteins and predicted their structures and identified kind of features connecting all of them through mechanistic interpretability.
Speaker 328:40 - 29:15
然后我们可以用它来预测原子分辨率的蛋白质结构,而且速度非常快。真的就是快得惊人。所以它某种程度上展示了结构预测中速度与准确性之间的 Pareto 最优前沿。这样一来,我们就能够刻画蛋白质宇宙中极其广阔的范围。我们对超过 1,100,000,000 个蛋白质进行了折叠并预测了它们的结构,还通过 mechanistic interpretability(机制可解释性)识别出了把它们彼此连接起来的一些特征。
Speaker 329:15 - 29:56
But I think the thing that I thought was most exciting about this model is this really general model of kind of protein biology. And so you can use it as a world model. You can actually really start to search the space of the world model to design new proteins. And it's really hitting state of the art across pretty much every structure prediction benchmark, and especially on protein protein interactions and protein antibody interactions, which is really critical for therapeutic design. And so what we found is you can actually now use the model to design proteins and to design actually single chain antibodies.
Speaker 329:15 - 29:56
但我认为这个模型最令人兴奋的一点,是它实际上是一个非常通用的蛋白质生物学模型。所以你可以把它当作一个 world model(世界模型)来使用。你真的可以开始搜索这个 world model 的空间,去设计新的蛋白质。而且它在几乎所有结构预测 benchmark(基准测试)上都达到了 state of the art(当前最佳水平),尤其是在 protein-protein interactions(蛋白质-蛋白质相互作用)和 protein-antibody interactions(蛋白质-抗体相互作用)方面,而这对治疗设计至关重要。所以我们发现,现在实际上已经可以用这个模型来设计蛋白质,甚至设计 single chain antibodies(单链抗体)。
Speaker 329:57 - 30:40
And so you can do all of this digitally and then, you know, really in a small number of experimental trials, basically like a 96 well plate, you know, select from hundreds of thousands of trajectories digitally, actually synthesize, you know, 96 proteins, test them in the lab in a really kind of short, easy experimental cycle. And we found nanomolar binders there. And so, you know, that's really the level for therapeutic activity. So it's really, I think, showing that you can have these kind of general purpose models that can you know, we didn't design a model for antibodies. We didn't design a model to, you know, to be able to bind one particular target.
Speaker 329:57 - 30:40
所以这一切都可以以数字化方式完成,然后你只需要非常少量的实验轮次,基本上比如一块 96 well plate(96 孔板),就可以先在数字空间里从数十万条 trajectory(轨迹)中进行筛选,真正合成出 96 个蛋白质,再在实验室里测试它们,形成一个非常短、非常简单的实验循环。我们在那里找到了 nanomolar binders(纳摩尔级结合物)。而这基本上就是具备治疗活性的水平。所以我认为,这确实说明你可以拥有这种通用模型——我们并没有专门为抗体设计一个模型,也没有专门设计一个模型去结合某一个特定靶点。
Speaker 330:40 - 31:16
You know, we just designed a model that could understand proteins, and you kind of get protein design as an emergent property. And then I also think it illustrates this kind of the power of open science and open source because, you know, we release this as basically an open discovery engine. And so really anyone can build on it. And so it takes what are these really intensive laboratory experiments where, you know, you have to screen through hundreds of thousands or millions of antibodies and high throughput screens in the lab. And, you know, you can really just kind of spin up an instance and compute and now, you know, be able to generate antibodies.
Speaker 330:40 - 31:16
你知道,我们只是设计了一个能够理解蛋白质的模型,而蛋白质设计就像是一种涌现属性自然出现了。我还认为,这也说明了 open science(开放科学)和 open source(开源)的力量,因为我们发布它时,本质上就是把它作为一个开放的 discovery engine(发现引擎)。所以实际上任何人都可以在它之上继续构建。这样一来,那些原本非常密集的实验室实验——你得在实验室里通过 high-throughput screens(高通量筛选)从数十万甚至数百万个抗体中进行筛选——现在你其实只需要启动一个实例、投入算力,就能够开始生成抗体。
Speaker 231:17 - 31:34
You should say more about sort of like we took that data when we looked at an antibody screen, and then we validated we looked at PDL in cells. And then we looked at it under the cryo EM and sort of how all that complemented validated what you were seeing in the models.
Speaker 231:17 - 31:34
你应该再多讲一点,比如说我们在观察一次抗体筛选时是如何利用那些数据的,然后我们做了验证,查看了细胞中的 PDL,接着又在 cryo EM(冷冻电镜)下进行了观察,以及这一整套流程是如何相互补充、如何验证你在模型里看到的结果的。
Speaker 331:34 - 31:52
That's right. Yeah. So I mean, I think it's really critical, you know, to actually go and characterize these molecules in the lab. It's you know, we have a structural biology center here. We have incredibly powerful cryo EM microscopes.
Speaker 331:34 - 31:52
没错,是的。所以我的意思是,我认为真正到实验室里去表征这些分子是非常关键的。我们这里有一个 structural biology center(结构生物学中心),也有极其强大的 cryo EM(冷冻电镜)显微镜。
Speaker 331:52 - 32:05
And so we're really able to kind of look at these proteins biophysically and functionally. And so, you know, we designed proteins for several therapeutically relevant targets, and we're able to confirm their function and It's some
Speaker 331:52 - 32:05
所以我们确实能够从生物物理和功能两个层面来观察这些蛋白质。我们针对几个与治疗相关的靶点设计了蛋白质,并且能够确认它们的功能,而且当它发挥预期作用时,这件事本身就很
Speaker 232:05 - 32:08
pretty light when it works the way it's supposed to.
Speaker 232:05 - 32:08
让人眼前一亮。
Speaker 332:08 - 32:15
Yeah, it's very amazing. Able to look at the structure So you can see atomic resolution kind of at the binding interfaces. Correct.
Speaker 332:08 - 32:15
是的,这非常惊人。它能够查看这种结构,所以你可以在结合界面处看到某种原子级分辨率。没错。
Speaker 532:15 - 32:41
I know a lot of your work is really focused on basic research and kind of building out the fundamentals. If I look at actual translation into drugs or drug development, often a clinical trial will be fifteen years. It'll cost $1,500,000,000 About 50,000,000 of that often is the molecule in preclinical work, and it's a few years of work. And then the other 1,450,000,000 and decade plus is actually the drug development side of it. A lot of that seems to be gated on some regulatory issues, some of it's recruitment, it's a variety of things.
Speaker 532:15 - 32:41
我知道你们很多工作确实都非常聚焦于基础研究,以及某种意义上搭建这些基础。如果我看真正向药物或 drug development(药物开发)的转化,临床试验往往要十五年,成本大约是 1,500,000,000 美元。其中通常大约 50,000,000 美元花在 molecule(分子)的临床前工作上,也就是几年的工作量。然后另外那 1,450,000,000 美元和十多年时间,实际上是在药物开发这一侧。这里面很多似乎都受制于一些监管问题,有些是招募问题,还有各种各样别的因素。
Speaker 532:41 - 32:56
But a lot of it also has to do with the failure of drugs and trials around things like absorption or toxicity or things like that. Have you considered at all tackling that other chain of sort of molecular design and thinking, or is the primary focus more on the basic biology and sort of the initial sort of molecules?
Speaker 532:41 - 32:56
但其中很大一部分也和药物及试验失败有关,比如吸收、毒性之类的问题。你有没有考虑过去解决那条偏向 molecular design(分子设计)和相关思考的链条,还是说你们的主要重点更多仍然放在基础生物学,以及最初那类 molecule(分子)上?
Speaker 232:56 - 33:06
I mean, at least my hope in building this like comprehensive model of how, you know, cells work is actually also being able to predict off target effects.
Speaker 232:56 - 33:06
我的意思是,至少对我来说,我希望建立这样一个关于细胞如何运作的综合模型,实际上也能用来预测 off-target effects(脱靶效应)。
Speaker 533:06 - 33:06
I think
Speaker 533:06 - 33:06
我觉得
Speaker 233:06 - 33:07
you can do some of
Speaker 233:06 - 33:07
你实际上可以做其中一些
Speaker 533:07 - 33:07
that
Speaker 533:07 - 33:07
那些事
Speaker 233:07 - 33:58
actually with biological models. Because right now some of the off target effects are we just didn't know, you know, your kidney cell also expressed this receptor. And then when we test it in human, like we see it happening and we see renal toxicity. And so being in If you have a single cell atlas that looks at all the different cell types, some of which actually were not predicted before we modeled them, you can start looking at which cells actually do have receptors for the target you thought you were exclusively targeting and be able to predict some of these downstream effects before we get into the human trials. And I think that that's actually one of the more exciting applications of like a transcriptomic model to understand actually how the different cells will react when you intervene and do something.
Speaker 233:07 - 33:58
实际上可以借助 biological models(生物学模型)来完成。因为现在有些 off-target effects(脱靶效应)是因为我们原本根本不知道——比如,你的 kidney cell(肾细胞)其实也表达这个 receptor(受体)。然后当我们在人类身上测试时,就会看到这种情况发生,并且看到 renal toxicity(肾毒性)。所以,如果你有一个 single-cell atlas(单细胞图谱)去观察所有不同的 cell types(细胞类型),其中有一些在我们建模之前其实并没有被预测到,你就可以开始查看,究竟哪些细胞实际上也拥有你原本以为只在目标对象上才有的那个 target(靶点)对应的 receptor(受体),并能够在进入人体试验之前,预测其中一些下游效应。我认为,这实际上是 transcriptomic model(转录组模型)一个更令人兴奋的应用:真正理解当你进行干预、做出某种操作时,不同细胞会如何反应。
Speaker 234:01 - 35:08
But I think when you think about delivery mechanisms and patient care, that's where you start having to be creative about when you ask like, what disease do you want to care first? There are certain diseases that will be easier to like deliver a therapeutic to, or the risk reward makes more sense. And, you know, I think we were all inspired by baby KJ, I think last year now, when the team at CHOP was able to deliver a CRISPR therapeutic to edit a mutation that he had that would have inevitably led him to significant neurotoxicity and altered his life. But we were able to, that disease was very carefully chosen because we needed to target his liver cells and if we could easily deliver a product that would work in his liver. And I think that's when the creativity, the wherewithal to choose the right applications can help us unlock the first applications.
Speaker 234:01 - 35:08
但我认为,当你考虑 delivery mechanisms(递送机制)和 patient care(患者照护)时,你就必须开始更有创造性地思考:当你问“你最先想治疗什么疾病”时,该怎么回答。有些疾病会更容易把 therapeutic(疗法)递送进去,或者说它的风险收益比更合理。你知道,我想我们去年都受到了 baby KJ 的启发,当时 CHOP 的团队成功递送了一种 CRISPR therapeutic(CRISPR 疗法),去编辑他所携带的一个突变;那个突变原本几乎注定会导致严重的 neurotoxicity(神经毒性),并改变他的人生。但我们之所以能做到,是因为那个疾病经过了非常谨慎的选择:我们需要靶向他的 liver cells(肝细胞),而且如果我们能够比较容易地把一种产品递送到他的肝脏并发挥作用。这也是我认为创造力以及选择正确应用场景的判断力,能够帮助我们解锁第一批应用的时候。
Speaker 335:08 - 35:38
Maybe something just to add to that also, you know, because, I mean, kind of you described the conventional, you know, drug development process. Right? And I think, you know, these kind of tools have the potential to have a lot of impact on that process. But, you know, what's interesting is to really start to think about kind of the new paradigms that can open up. And, you know, what does it mean if if you can you know, the barrier to develop a drug, to design a molecule, you know, to kind of get through all of those stages is so much lower.
Speaker 335:08 - 35:38
也许还可以再补充一点,因为你的描述基本上是传统的药物开发流程,对吧?我觉得,这类工具当然很有潜力对这个流程产生巨大影响。但真正有意思的是,开始去思考它会开启哪些新的 paradigms(范式)。如果开发一种药物、设计一个分子、再一路推进 through all of those stages 的门槛都大幅降低,这到底意味着什么?
Speaker 335:38 - 35:52
And so you have programmable biology, and you can, really start to create a medicine for every individual patient. I think that has enormous implications for how we do drug development and what the future of medicine looks like.
Speaker 335:38 - 35:52
这样一来,biology(生物学)就变得 programmable(可编程)了,你也真的可以开始为每一位个体患者创造专属药物。我认为,这对我们如何做药物开发,以及未来的医疗会是什么样子,都有极其深远的影响。
Speaker 535:52 - 35:54
Mhmm. It'll be an exciting day when the FDA accepts like a
Speaker 535:52 - 35:54
嗯。如果有一天 FDA 接受比如
Speaker 435:54 - 36:20
virtual clinical trial for the phase one or something, or, know, it's based on some person's view of that person. Yeah. Or even short of that, like thinking about the specific like mechanisms where you see this acceleration, like I imagine if people feel like they can predict impact in kidney cells or have a stronger perspective on tox because they have this broader understanding, they'll be willing to try many more programs.
Speaker 435:54 - 36:20
phase one 的 virtual clinical trial(虚拟临床试验)之类的,那会是非常令人兴奋的一天;或者说,它是基于某个人对该患者情况的判断。是的。甚至退一步讲,只看你会在哪些具体机制上看到这种加速——我能想象,如果人们觉得自己可以预测对 kidney cells(肾细胞)的影响,或者因为有了更广泛的理解而对 tox(毒性)有更强的判断,他们就会愿意去尝试更多项目。
Speaker 136:20 - 37:08
Yeah. The recruitment could also change. We have this program, Rare as one, and the basic idea is that a lot of people focus on the most common diseases, but there's this long tail. And the economics don't quite work out for companies to focus on those diseases, but if you can make it so that the groups of patients can kind of come together and organize and say, hey, we would take an experimental drug on this, then it actually, because of the cost that you're talking about and how that's a huge amount of the overall cost, if you can flip that, then it actually makes it so the economics make a lot more sense to then if you can generate something more easily and you can pair it with a group of people. I think one of the interesting things from science and engineering is that often can hit your head against the wall on the common problems, and in this case diseases.
Speaker 136:20 - 37:08
是的,recruitment(受试者招募)也可能会改变。我们有一个项目叫 Rare as one,基本想法是,很多人都聚焦于最常见的疾病,但其实还存在一个 long tail(长尾)。从经济性上看,公司并不太划算去关注那些疾病;但如果你能让这些患者群体组织起来,说,嘿,我们愿意尝试这方面的 experimental drug(实验性药物),那么结合你刚才提到的成本问题——而那又占了总体成本中的很大一部分——如果你能把这点扭转过来,那么在经济上就会合理得多:你更容易生成某种候选方案,又能把它和一群患者匹配起来。我觉得 science and engineering(科学与工程)里一个有意思的现象是,人们常常会在那些最常见的问题上反复碰壁,这里对应的就是常见疾病。
Speaker 137:09 - 37:34
But a lot of times you learn a lot more about a system from finding some kind of rare or weird side thing that's happening in each case. I don't know. I think that that's always been kind of an interesting part of this that actually connects pretty well to this because now you're gonna be able to enable a long tail of new kind of ideas to get tried and enable them to potentially get tested more easily.
Speaker 137:09 - 37:34
但很多时候,真正让你更理解一个系统的,反而是在每种情况下发现某种罕见的、奇特的边缘现象。我不知道,我一直觉得这是这件事里很有意思的一部分,而且它和这里其实联系得非常紧密,因为现在你将能够让 long tail(长尾)里的各种新想法有机会被尝试,也让它们更有可能更容易地被测试。
Speaker 237:34 - 37:52
Yeah. That's a really good point on rare. In our rare disease cohorts, first of all, they're incredibly inspiring and powerful. But patient groups are self organizing patient registries, natural history registries, biobanks. They're organizing their own clinical trials.
Speaker 237:34 - 37:52
是的,你关于 rare(罕见病)的观点特别好。在我们的 rare disease(罕见病)队列里,首先,这些群体本身就极具启发性,而且非常有力量。patient groups(患者群体)正在自发组织 patient registries(患者登记库)、natural history registries(自然病程登记库)、biobanks(生物样本库)。
Speaker 237:52 - 38:14
There's gene therapy that one disease group has moved forward over the course of like, I want to say like three to five years rather than decades. And the speed is so fast because the patients themselves have organized the resources that a scientist or a clinician might need. And it's incredible.
Speaker 237:52 - 38:14
他们甚至在组织自己的 clinical trials(临床试验)。有一种 gene therapy(基因疗法),某个疾病群体大概只用了三到五年就把它推进了下去,而不是几十年。之所以速度这么快,是因为患者自己就把科学家或临床医生可能需要的资源都组织好了。这非常了不起。
Speaker 138:14 - 38:33
But I think to some degree you're going to need something like this because there are gonna be many more new things that can get created. But that doesn't mean that for, like, the general population that you're not gonna want the same level of vetting that we've had historically. But making it so that people who wanna be on more of the frontier have the ability to do that is I think also going be pretty helpful.
Speaker 138:14 - 38:33
但我认为,在某种程度上,你会需要类似这样的机制,因为未来能被创造出来的新事物会多得多。不过,这并不意味着对于普通大众,你不希望保留我们历史上一贯采用的那种同等程度的审查与验证。只是如果能让那些想站在更前沿的人也有能力这么做,我觉得也会相当有帮助。
Speaker 538:33 - 38:41
Yeah, letting people opt in to be part of trials I think is one of the big shifts that is starting to happen but could really help accelerate biology in general.
Speaker 538:33 - 38:41
是的,让人们可以自愿选择加入试验,我认为这是正在开始发生的一项重大转变,而且它确实能帮助整体加速 biology 的发展。
Speaker 438:41 - 39:07
All three of you have mentioned at different points like the power of open ecosystems in such a large space. Like I think some of that logic around open source and the breadth or diversity of data collection that you guys were describing, it should also apply in the like language model world and the multimodal AI world. Do you think that's right? Does any of the work you're doing here change how you think about AI and meta?
Speaker 438:41 - 39:07
你们三位在不同地方都提到过,在这么大的一个领域里,开放生态系统的力量。比如我觉得,你们刚才描述的那些关于 open source(开源)以及数据收集广度或多样性的逻辑,也应该适用于 language model(语言模型)和 multimodal AI(多模态 AI)世界。你们觉得这是对的吗?你们在这里做的这些工作,会不会改变你们对 AI 和 meta 的看法?
Speaker 539:07 - 39:08
I mean, I think
Speaker 539:07 - 39:08
我的意思是,我觉得
Speaker 139:08 - 39:41
it's sort of a similar philosophy overall. And, you know, Priscilla was talking about this, that a lot of our focus is building tools that empower individuals to do things. And that's sort of a common theme across a lot of the things that that I work on is just kind of putting the technology in individuals' hands. We don't believe in this, like, very centralized future where there should be a small number of institutions that that basically are are advancing all of this stuff. Our vision is not that there's gonna be, like, some central superintelligence that solves all of science.
Speaker 139:08 - 39:41
整体上这算是一种相似的理念。而且,Priscilla 刚才也谈到了这一点:我们的很多重点都在于打造工具,赋能个人去做事情。这也是我做的很多事情中的一个共同主题,就是把技术交到个人手中。我们并不相信那种高度中心化的未来——不认为应该由少数机构来推动这一切的发展。我们的愿景不是说,未来会出现某种中央 superintelligence(超级智能)来解决全部科学问题。
Speaker 139:41 - 40:06
I think, people are really important, and I think will be more important in the future. And giving people more tools to be more productive is gonna be a critical part of any kind of positive future that both and that's how progress has always been made historically. Right? It's not through centralization. It's through empowering individuals to try things that are somewhat out of the mainstream that other people didn't think were good ideas because they thought they were good ideas that already had been done.
Speaker 139:41 - 40:06
我认为,人是非常重要的,而且未来会变得更加重要。给人们更多工具,让他们更高效地产出,将会是任何积极未来中的关键组成部分,而且历史上的进步也一直都是这么实现的,对吧?不是通过中心化,而是通过赋能个人去尝试一些并不主流的东西——那些别人觉得不是好主意、因为他们以为好主意早就已经被做过了的事情。
Speaker 140:06 - 40:32
So I think that that's very central to the whole ethos of I mean, to some degree, it's like why you create something like social media, right, to give people a voice. It's, you know, I think a lot of the the stuff that we that I care about in terms of empowering people with individual AI. Open source is one instantiation of it. It's not the only way to do it. It certainly is one way that you basically are saying we're going to take this technology and put it in everyone's hands.
Speaker 140:06 - 40:32
所以我认为,这其实是整个 ethos(理念)中非常核心的一点。某种程度上,这也是你为什么会创造 social media(社交媒体)这样的东西,对吧?是为了给人们发声的机会。还有,我认为,就我所关心的那些用 individual AI(个人 AI)赋能人们的事情来说,open source(开源)是其中一种实现方式。它不是唯一的方式,但它当然是一种明确表达“我们要把这项技术交到每个人手中”的方式。
Speaker 140:32 - 41:07
In terms of science, I think it really makes sense, and we're deeply committed to open source. There are obviously interesting considerations on this that are important too because there's a lot of considerations around biosafety and things like that that we're going to need to balance and think through how to handle. But I think overall, this is very deep in the ethos of the work that we're doing both at Biohub and probably a theme for a lot of the stuff that I do is just we believe that a positive future is one where you build a technology as a tool, you put it in individuals' hands, and that's kind of how society makes progress.
Speaker 140:32 - 41:07
就科学而言,我认为这确实很有道理,而且我们对 open source(开源)有非常深的承诺。当然,这里面也显然有一些值得认真考虑的重要问题,因为还涉及很多关于 biosafety(生物安全)之类的考量,我们需要在其中做平衡,并思考该如何处理。但总体来说,这非常深地植根于我们正在做的工作的 ethos(理念)之中——无论是在 Biohub,还是大概在我做的很多事情里都一样:我们相信,积极的未来是这样的——你把技术构建成一种工具,把它交到个人手中,而社会也正是这样取得进步的。
Speaker 441:08 - 41:24
You have this, I think, incredibly ambitious mission at Biohub. And yet, you know, the AI scientists that work here could also go work in commercial enterprises. How do you think about the talent and, like, how to bring people to Biohub?
Speaker 441:08 - 41:24
我觉得,Biohub 有一个极其宏大的使命。不过你也知道,在这里工作的 AI scientists 其实也完全可以去商业公司工作。你怎么看待人才这个问题,以及该如何把人吸引到 Biohub 来?
Speaker 141:25 - 42:05
I mean, where do you to start? I think, you know, yeah, I mean, it's a very hot market for AI researchers. But I think that part of what that means is that there's a lot of demand, and they're very in demand and can work on the things that they wanna work on. And I think this gets back to this point again about frontier AI and frontier biology. So yeah, I mean, think the AI researchers who work here could go work on language models or things at any of the main labs, But those labs don't have the frontier biology part attached to it.
Speaker 141:25 - 42:05
我的意思是,这该从哪里说起呢?我觉得,你知道,没错,现在对 AI researchers 来说确实是个非常火热的市场。但我认为,这其中一部分意味着市场需求非常大,他们很抢手,也因此能去做自己真正想做的事情。我想这又回到了刚才那个关于 frontier AI 和 frontier biology 的点。所以,是的,我的意思是,在这里工作的 AI researchers 当然也可以去任何一家主要实验室做 language models 或类似的工作,但那些实验室并不具备与之结合的 frontier biology 这一部分。
Speaker 142:05 - 42:27
So I think that there's also a just very large mission component of this, which is there's an ability to do this unique work here that you just can't really do at the other places. So if that's what your focus is, then this then I don't actually think that there's any other organization in the world that's doing both the frontier biology and the frontier AI.
Speaker 142:05 - 42:27
所以我认为,这里面还有一个非常强的 mission(使命)层面的因素,那就是你能在这里做一种独特的工作,而这种工作在其他地方基本上做不了。如果这正是你的关注重点,那么我其实不认为世界上还有任何别的组织,能同时做 frontier biology 和 frontier AI。
Speaker 242:27 - 42:28
Yeah. Why are you here, Alex?
Speaker 242:27 - 42:28
对。Alex,你为什么会在这里?
Speaker 342:28 - 42:37
I mean, I think it's it's really simple. Yeah. Our our mission is to take care of prevent disease. And and I think, you know, there's it's it's just such a
Speaker 342:28 - 42:37
我的意思是,我觉得这其实非常简单。对。我们的使命是照顾人们、预防疾病。而且我觉得,你知道,这真的是一个如此——
Speaker 442:37 - 42:39
powerful with a straight face in a less than 100 timeline.
Speaker 442:37 - 42:39
——强有力的使命,而且你完全可以一本正经地说,在不到 100 年的时间线内实现它。
Speaker 242:40 - 42:42
Serious now. There's no more.
Speaker 242:40 - 42:42
现在认真点。没了。
Speaker 142:42 - 42:43
That's yeah.
Speaker 142:42 - 42:43
对,就是这样。
Speaker 342:44 - 42:53
Yeah. It's it's a really powerful mission. And I I think, you know, you yeah. I mean, it's it's just, you know, scientists, I think, are
Speaker 342:44 - 42:53
是的。这确实是一个非常有力量的使命。我觉得,你知道,是的。我的意思是,我想科学家本身就是
Speaker 442:53 - 42:54
very motivated by science. Yes.
Speaker 442:53 - 42:54
被科学强烈驱动的。是的。
Speaker 342:54 - 43:19
Yeah. It's it's something people are deeply motivated by. And I think, you know, we're at this moment in time where that actually seems like something that can be achieved. And I think, you know, we're building a really unique place where where we're tackling that problem. And, you know, we have the resources, I think, kind of the right things to actually really go after that and do that.
Speaker 342:54 - 43:19
是的。这是人们会发自内心、深深为之驱动的事情。而且我觉得,你知道,我们正处在这样一个时间点:这件事现在看起来实际上是可以实现的。我认为,我们正在打造一个非常独特的地方,在这里我们正面应对那个问题。而且,你知道,我觉得我们拥有资源,也拥有某种意义上真正去全力攻克并完成这件事所需要的正确条件。
Speaker 443:20 - 43:34
Yeah. I mean, that resonates with me as somebody who, you know, talks to and hires a lot of research scientists. They want to know if you have the data, if you have the tools, if you have the compute, if you have the talent, and then what the mission is. And so I actually think that's super competitive.
Speaker 443:20 - 43:34
是的。我的意思是,作为一个经常与 research scientists(研究科学家)交流并招聘他们的人,这一点我很有共鸣。他们想知道你有没有 data(数据),有没有 tools(工具),有没有 compute(算力),有没有 talent(人才),然后才会看 mission(使命)是什么。所以我实际上认为,这一点非常有竞争力。
Speaker 143:34 - 44:08
The other thing is that you don't need a very large team. Right? So I think it's like an interesting thing about the world is that people care about different missions, and that's good. I think that's part of the whole I mean, part of why building these tools and giving people the ability to explore what they care about, whether it's across science or just across everything, is such a powerful way to make progress in society is that people care about different things. And in order to make progress in AI, you don't need many, many hundreds of AI researchers or thousands or anything like that.
Speaker 143:34 - 44:08
另一点是,你并不需要一个特别庞大的团队,对吧?所以我觉得,关于这个世界有一个很有意思的地方,就是人们关心不同的使命,而这很好。我认为这也是整体的一部分——我是说,之所以构建这些工具,并赋予人们探索自己所关心之事的能力,不管是在科学领域还是在所有其他领域,之所以会成为推动社会进步的一种强大方式,正是因为人们关心的东西各不相同。而为了在 AI 上取得进展,你并不需要成百上千的 AI researchers(AI 研究人员),也不需要几千人之类的规模。
Speaker 144:08 - 44:27
I think you can really make progress with a very strong group of a dozen or a couple dozen people. And, yeah, mean, finding people who care about this mission is not a particularly hard thing. I mean, this is like a super important thing in the world. So I think that that's yeah, it's just kind of a cool thing about the world is that people obviously are drawn to different missions.
Speaker 144:08 - 44:27
我认为,真正取得进展,靠一个非常强的十几人或二十来人的团队就可以了。然后,是的,我的意思是,去找到那些关心这个 mission(使命)的人,并不是一件特别困难的事。我的意思是,这在世界上是一件极其重要的事情。所以我觉得,这也算是这个世界一个很酷的地方:人们显然会被不同的使命所吸引。
Speaker 444:28 - 45:01
So I think the simplest mental models that folks have, even if they're paying attention to the space, are essentially like, okay, you know, structured prediction models for proteins and protein protein interaction models. And then, so there's this one piece, which is fundamental understanding. And then there's this like theory of someday we're just going to be able to like zero shot things into either the clinic or the clinic with much, much better hit rate. What needs to happen for us to go from ESM Fold two to this other piece? Is that feasible?
Speaker 444:28 - 45:01
所以我觉得,即使是那些在关注这个领域的人,他们脑中最简单的 mental models(心智模型)基本上也是这样的:好吧,你知道,就是针对 protein(蛋白质)的 structured prediction models(结构化预测模型),以及 protein-protein interaction models(蛋白质-蛋白质相互作用模型)。然后,这里有一个部分,是基础理解。再然后,还有这样一种理论:总有一天,我们将能够以 zero-shot(零样本)的方式,把东西直接推进到 clinic(临床),或者至少推进到临床阶段,而且命中率会高得多得多。从 ESM Fold two 走到后面这一部分,需要发生什么?这可行吗?
Speaker 345:01 - 45:19
I think that's a great question. I mean, I would say that I'm really optimistic on that. So I think, you know, on the one hand, you know, these are problems that historically, you know, people could spend kind of an entire career working on. Like, how do you how do you figure out how to effectively optimize a drug? How do you get it, you know, get it through preclinical?
Speaker 345:01 - 45:19
我觉得这是个很好的问题。我的意思是,我会说,我对此是非常乐观的。所以我觉得,你知道,一方面,这些问题在历史上一直都是那种人们可能花上整个职业生涯去研究的问题。比如说,你要怎样才能弄清楚如何有效地优化一种 drug(药物)?你又怎样才能让它通过 preclinical(临床前研究)?
Speaker 345:19 - 45:44
How do you do the early safety? I think that, you know, when you have a new scientific paradigm, kind of, you know, questions that were once hard kind of become simplified through the new paradigm. And so I'm very optimistic that kind of many of these core problems will be solved kind of in an emergent way through these models. Mhmm. And I think one great example of that is is toxicity.
Speaker 345:19 - 45:44
你们是如何在早期做 safety(安全性)工作的?我觉得,你知道,当出现一种新的 scientific paradigm(科学范式)时,那些曾经很难的问题,某种程度上会在这个新范式下变得更容易。所以我非常乐观地认为,其中许多核心问题会通过这些模型,以一种 emergent(涌现)的方式得到解决。嗯。我觉得一个很好的例子就是 toxicity(毒性)。
Speaker 345:44 - 46:13
Whereas if if you can kind of really digitally digitally kind of simulate everything and be able to predict, you know, where a drug is going to distribute and bind across the human body. You know, like you kind of have the beginning of a solution to that kind of problem. So I think that once you have these kind of accurate representations at the molecular level, we're going to start to see really rapid progress on a lot of these core problems.
Speaker 345:44 - 46:13
相比之下,如果你真的能够以数字方式去模拟一切,并且能够预测一种药物会如何在人体内分布,以及它会在全身哪些地方发生 bind(结合),那你就等于已经开始拥有解决这类问题的方案。所以我认为,一旦我们拥有这种分子层面的精确表征,我们就会开始看到这些核心问题中的很多都取得非常快速的进展。
Speaker 446:13 - 46:19
What is the most exciting use or experimentation with the models you've seen in the last week since release?
Speaker 446:13 - 46:19
自从发布以来,在过去这一周里,你看到的最令人兴奋的模型用途或实验是什么?
Speaker 346:19 - 46:53
Yeah. I mean, it's just been great to kind of see it get integrated in all kinds of things. I think one of the really interesting things that we've been seeing is people kind of connecting it with AgenTex systems to just kind of do automated design and kind of just automate that whole process. So it's really, I think, another example of how you can kind of see bringing together AgenTeq and Frontier AI with, you know, the ability to have a world model for biology and actually reason about biology and, you know, really kind of start to automate the entire design process.
Speaker 346:19 - 46:53
是的。我的意思是,看到它被集成进各种各样的东西里,真的很棒。我觉得我们看到的一件非常有趣的事,是人们在把它和 AgenTex systems 连接起来,用来做 automated design(自动化设计),基本上就是把整个流程自动化。所以我认为,这又是一个例子,说明你可以把 AgenTeq 和 Frontier AI 结合起来,再加上拥有一个面向 biology(生物学)的 world model(世界模型)、并且真正对 biology 进行推理的能力,从而真正开始把整个设计流程自动化。
Speaker 446:53 - 47:25
How do you decide what the next step in the research agenda is? It's like world model for biology, and then I could I'm just gonna be very coarse here. Like, I could scale it up, I could add more data, I could add, like adding data is a nontrivial thing in terms of new methods and domains. Like what is, do you take input from the larger ecosystem about, you know, how people are using it and what would make it more useful? Or is it really like we we understand, like, the next step of structures or coverage that we're looking for?
Speaker 446:53 - 47:25
你们是如何决定 research agenda(研究议程)里的下一步的?比如说,现在是 biology 的 world model,然后我这里说得很粗略:我可以把它 scale up(扩展规模),可以加更多 data(数据),而“加数据”这件事本身在新方法和新领域上也并不简单。那你们会不会从更大的 ecosystem(生态系统)那里获取输入,看看人们是怎么用它的,以及怎样才能让它更有用?还是说,其实更像是你们自己已经清楚地知道,下一步要追求的是哪些 structure(结构)或 coverage(覆盖范围)?
Speaker 347:25 - 47:32
I mean, I think there's two things. So, like, we have a view on kind of the next big challenge, which I think is, you know, the the virtual cell
Speaker 347:25 - 47:32
我的意思是,我觉得有两件事。也就是说,我们对于下一个重大挑战是有一个判断的,我认为那就是 virtual cell(虚拟细胞)
Speaker 247:32 - 47:32
Mhmm.
Speaker 247:32 - 47:32
嗯。
Speaker 347:32 - 47:38
And, you know, really being able to kind of ladder up the hierarchy of biological complexity to the cell.
Speaker 347:32 - 47:38
以及,真正能够沿着 biological complexity(生物复杂性)的层级一路向上推进,到达 cell(细胞)这一层面。
Speaker 447:39 - 47:44
Sorry. Very basic question. Yes. Virtual cell model. Like, what is the input and output I should expect?
Speaker 447:39 - 47:44
抱歉,一个很基础的问题。是的。virtual cell model(虚拟细胞模型)的话,我应该预期它的输入和输出分别是什么?
Speaker 347:44 - 48:13
Yeah. I mean, I think there's different views on that. But I think kind of what you ultimately want is a system that can really model each of the levels of complexity. So, you know, the the proteomic layer, the genetic layer, the transcriptomic layer, and connect that to the phenotype. And you need enough generality so that you can ask the model questions about a new intervention in a context that it hasn't been trained on and and kind of get an answer from it.
Speaker 347:44 - 48:13
对,我的意思是,我觉得对此会有不同看法。但我认为,你最终真正想要的是一个能够真实建模各个复杂性层级的系统。比如说,proteomic layer(蛋白质组层)、genetic layer(遗传层)、transcriptomic layer(转录组层),并把这些和 phenotype(表型)连接起来。而且你还需要足够的通用性,这样你才能在一个模型未曾训练过的情境中,就某种新的 intervention(干预)向它提问,并从中得到答案。
Speaker 348:13 - 48:26
And, you know, the gap that we we need to close as a field is being able to really make those predictions that can generalize. So that's gonna require an enormous effort to generate data.
Speaker 348:13 - 48:26
而且,你知道,作为一个领域,我们需要弥合的差距,是要真正能够做出那些可以泛化的预测。所以这将需要付出极其巨大的努力来生成数据。
Speaker 148:27 - 48:42
Yeah. And then, I mean, in terms of what you decide to do next, I think this is like you know, a pretty normal process of constraint management. Right? I mean, it's like like, I think every lab in every field across the world probably feels compute constrained. I think that that's probably true here too.
Speaker 148:27 - 48:42
对。然后,我的意思是,就你接下来决定做什么而言,我觉得这就像是一个相当正常的约束管理过程。对吧?我的意思是,我觉得全世界每个领域里的每个实验室,大概都会觉得自己受限于 compute(算力)。我想这里大概也是这样。
Speaker 148:42 - 48:52
Right? It's like so, I mean, I know, like, you know, there's always questions. It's like, okay. Should we double down more on advancing the protein piece? Should we do more of the cellular stuff?
Speaker 148:42 - 48:52
对吧?就是说,我知道,你懂的,总会有这样的问题:好,那我们是不是应该更进一步加码推进 protein 这一块?我们是不是应该多做一些 cellular(细胞层面)的东西?
Speaker 148:52 - 49:35
I think those are kind of ongoing debates in terms of how you sequence that. And then, yeah, within that, there's kind of being at the Pareto frontier about how much you wanna train the different models in order to like, and and the size of the models is also dependent on the scale of the data that you have because you have yeah. For for obvious reasons. So, yeah, I mean, I think it's there's some of that as just where you wanna be on the curves and then normal constraints, but I think that this is, like, probably the same process that, like, any research organization goes through of, like, you wanna go in all these different directions, and you're just trying to constraint optimize and make enough progress to do world class work at one thing at a time while planting some seeds that can blossom over the next couple years as well. Yeah.
Speaker 148:52 - 49:35
我觉得这些在如何安排先后顺序上,都是持续存在的讨论。然后,对,在这之中,还有一个问题,就是要处在 Pareto frontier(帕累托前沿)上的什么位置——你愿意把不同模型训练到什么程度;而且模型的大小也取决于你拥有的数据规模,因为这个——对——原因很明显。所以,对,我觉得其中一部分只是你想落在这些曲线的什么位置,以及一些常规约束;但我认为,这大概也和任何研究组织都会经历的过程一样:你想同时朝很多不同方向推进,而你只是在努力做约束优化,在一次专注一件事的前提下取得足够的进展,做出 world class(世界一流)的工作,同时也播下一些种子,让它们在未来几年里开花结果。对。
Speaker 149:35 - 49:37
This has been the most dynamic
Speaker 149:35 - 49:37
这是我所见过最具动态变化的
Speaker 549:37 - 49:44
period of technology, at least, I've seen my career. Mean it's so exciting in terms of everything that's happening with AI. Every week there's something new that's changed.
Speaker 549:37 - 49:44
一段技术时期,至少在我的职业生涯里是这样。我的意思是,AI 方面正在发生的一切,实在太令人兴奋了。几乎每周都会有新的变化出现。
Speaker 249:44 - 49:46
Are you tired or invigorated?
Speaker 249:44 - 49:46
你是觉得累,还是觉得更受激励?
Speaker 549:46 - 49:52
I'm both. I everybody's feel in manic phase.
Speaker 549:46 - 49:52
我两种状态都有。我能感受到大家在 manic phase(躁狂阶段)里的那种感觉。
Speaker 149:52 - 49:55
Yes. It's a combination of invigorated and exhausting.
Speaker 149:52 - 49:55
对。那是一种既让人振奋、又让人筋疲力尽的组合。
Speaker 549:55 - 50:10
Yeah. It's wonderful. And so I guess, know, things are very unpredictable right now. It's really hard to know what's coming. We have this almost like early signs of exponentation on the model side with agentic flows that we're starting to see in really interesting ways.
Speaker 549:55 - 50:10
是啊。很奇妙。所以我想说,你知道,现在很多事情都非常不可预测。真的很难知道接下来会发生什么。我们在 model 这一侧已经看到了几乎像是指数式增长的早期信号,尤其是在 agentic flows(agent 驱动流程)方面,开始以非常有意思的方式显现出来。
Speaker 550:11 - 50:37
Models starting to help more and more with models, but that's still very very early days for that. If you're thinking back five years from now and you were to define what success was relative to your efforts, and I know things are very dynamic, things changed a lot. But you have this common thread of tooling for the Biohub. You have a common thread of empowering scientists at scale. You're looking back five years from now, is there a specific thing that you really want to make sure that you've accomplished or achieved or a primary goal?
Speaker 550:11 - 50:37
model 也开始越来越多地帮助构建 model,但这方面仍然还处在非常非常早期的阶段。如果把时间拉到五年后,再回头定义相对于你们当前努力而言,什么算是成功——我知道现在局势很动态,变化也很多——但你们有一条关于 Biohub tooling(工具体系)的共同主线,也有一条关于大规模赋能科学家的共同主线。如果五年后回头看,有没有一件你真的特别想确保自己已经完成、实现的具体事情,或者一个最核心的目标?
Speaker 450:37 - 50:37
Well, I mean, I think we
Speaker 450:37 - 50:37
嗯,我的意思是,我觉得我们
Speaker 150:37 - 51:06
have a pretty clear view of this hierarchical set of world models that we wanna build around biology. And the other part of that is that we wanna do the highest quality work in the world. Right? I mean, I and I think we're basically set up to do that between having a world class AI research team and this collection of of Biohubster world class life sciences research organizations. I think that that's, like, fundamentally a setup that no other organization in the world has.
Speaker 150:37 - 51:06
对要围绕 biology(生物学)构建的这套分层 world models(世界模型)有相当清晰的看法。另一部分是,我们想做出全球最高质量的工作。对吧?我是说,我也认为,凭借一支 world class(世界一流)的 AI 研究团队,再加上由 Biohub 旗下 world class 的生命科学研究机构组成的这个集合,我们基本上已经具备了做到这一点的条件。我觉得,这从根本上说是一种全球没有其他组织拥有的配置。
Speaker 151:07 - 51:34
But, you know, you can have a lot of great ingredients, and that doesn't guarantee that you succeed. And so, I mean, to me, five years from now looking back, I think, you know, I'm sure other labs or efforts will try to produce things that approximate what we're trying to do. And I just think that we should be able to do something that is meaningfully better and a unique intellectual contribution to the world. I think that that's kind of what you, whenever you do any kind of research, that's what you're trying to do. Right?
Speaker 151:07 - 51:34
但是,你知道,拥有很多很棒的要素,并不保证你一定会成功。所以对我来说,如果五年后回头看,我觉得,肯定会有其他实验室或其他团队尝试做出一些近似于我们想做之事的东西。而我认为,我们应该能够做出明显更好的成果,并且为世界带来一种独特的 intellectual contribution(智识贡献)。我想,这某种程度上就是你做任何研究时想要做到的事。对吧?
Speaker 151:34 - 51:54
So, yeah, so if we do that, think we'll all feel very good. I would also expect that at some point we'll just start seeing a lot more idea generation from the people using the models. But I have enough faith that that part will materialize that for me, it's more just about, like, making sure that we do world class work. And I think if we do, like, the rest almost will take care of itself.
Speaker 151:34 - 51:54
所以,是的,如果我们做到了这一点,我想我们大家都会感觉非常好。我也预计,到了某个时候,我们会开始从使用这些 model 的人那里看到更多得多的 idea generation(想法生成)。不过我对那一部分最终会自然出现这件事已经有足够的信心,所以对我来说,更多还是要确保我们做的是 world class 的工作。我觉得如果我们做到了,剩下的事情几乎会自行水到渠成。
Speaker 451:54 - 52:04
Very last question for you. Snapshot of it's mid twenty twenty six. What's the biggest update in your own thinking about Biohub or the domain from the last year?
Speaker 451:54 - 52:04
最后一个问题。假设现在是 2026 年年中,回看过去一年,你对 Biohub 或这个领域本身的思考,最大的更新是什么?
Speaker 152:05 - 52:29
Well, from the last year. I mean, you joined in the last year. I mean, I think the the biggest thing that that we basically rotated and and I think in the last year, we basically kinda formalized that Biohub is the main focus of our philanthropy. So I think this has been a very big shift. But Alex and the team coming in, I think, been interesting, not only because it's a world class group.
Speaker 152:05 - 52:29
嗯,就过去一年而言。我的意思是,你也是在过去一年加入的。我觉得,最大的变化基本上是我们的重心做了调整,而且在过去一年里,我们算是正式明确了:Biohub 是我们 philanthropy(慈善事业)的主要重点。所以我认为这是一个非常大的转变。而 Alex 和团队的加入,我觉得很有意思,不只是因为这是一支 world class(世界级)的团队。
Speaker 152:29 - 52:53
Right? I mean, you guys have worked together for a while. I think also, I mean, you talked about how stuff is changing so much in the field. I think one thing that's underrated is this is an extremely talented group of people who also know each other and work well together and are stable and good. And I think that that also is underestimated in terms of the compounding benefit of people being able to work well in a stable environment over time.
Speaker 152:29 - 52:53
对吧?我的意思是,你们已经一起合作了一段时间。我还觉得,正如你所说,这个领域里的很多东西变化非常快。我认为一个被低估的点是:这是一群极其有才华的人,他们彼此熟悉、合作顺畅,而且稳定、可靠。我觉得,人们也低估了这样一种复利式收益:大家能够在一个稳定的环境里长期高效协作,这本身会不断累积价值。
Speaker 152:55 - 53:41
So I think that that's a really important piece. But part of what we wanted to do was, prior to Alex leading the effort, the previous leaders of the Biohub were basically primarily biologists who were interested in technology. Now I think this is the point where we really flip that, where, I mean, obviously you have a background in biology as well, but you are primarily an AI researcher who has a background in biology. I think that that's a deep reflection on the way that we expect that this is going to drive more value in the future. So those are probably the biggest updates in the last year in terms of the work that we're doing.
Speaker 152:55 - 53:41
所以我觉得那是非常重要的一部分。但我们当时想做的一件事是,在 Alex 领导这项工作之前,Biohub 之前的领导者本质上主要是对技术感兴趣的生物学家。现在我觉得我们真正把这一点反过来了。也就是说,显然你也有 biology(生物学)背景,但你首先是一位有生物学背景的 AI researcher(AI 研究者)。我认为,这深刻反映了我们对未来价值驱动方式的判断:我们预计这种路径将在未来创造更多价值。所以,从我们正在做的工作来看,这大概就是过去一年里最大的几项更新。
Speaker 153:41 - 53:48
I mean, it's a new leader, not just the leader, but a team that I think has been is like really good.
Speaker 153:41 - 53:48
我的意思是,这是一个新的 leader(领导者),而且不只是领导者本身,还有一支我认为真的非常出色的团队。
Speaker 553:48 - 53:49
And then, yeah, I mean,
Speaker 553:48 - 53:49
然后,嗯,我的意思是,
Speaker 153:49 - 54:13
I think on the rest of the industry, it's like it's on track. I mean, I think, like, every it's it's kind of this crazy thing because, like, when you have an exponentially growing curve, I think the way that an exponential curve feels is it's growing so quickly that the the kind of emotional feeling is it can't possibly keep going. Mhmm. Right? Because, like, it's because it's just like but but, I mean, the nature of an exponential curve is it it doesn't just keep going.
Speaker 153:49 - 54:13
我觉得,就行业其余部分而言,一切都还在按轨道前进。我的意思是,我觉得,这件事有点疯狂,因为当你面对一条指数增长曲线时,指数曲线给人的感受是:它增长得太快了,以至于情绪上的直觉会觉得,这不可能继续下去。嗯哼。对吧?因为这就像是——但是,我的意思是,指数曲线的本质并不只是它会继续下去。
Speaker 154:13 - 54:30
It keeps accelerating. Right? Exponential growth is accelerating. So I think that that has all these emotions and psychology attached to it. But I think fundamentally when you look at the curve in the industry, the kind of fundamental thing is it is on track.
Speaker 154:13 - 54:30
它还会继续加速。对吧?指数增长本身就是在加速的。所以我觉得,这会牵动很多情绪和心理反应。但我认为,从根本上看,当你去看这个行业的曲线时,最核心的一点是:它仍然在按预期推进。
Speaker 154:30 - 54:51
It has remained on that curve, which I think has all these very profound implications for all of these domains. But, certainly, it validates and makes one feel very good about making a very big investment in in the things that that will play out if that if you stay on that track, and it seems like we are. So that, I think, is very good news.
Speaker 154:30 - 54:51
它一直保持在那条曲线上,我认为这对所有这些领域都有非常深远的影响。但毫无疑问,这验证了这样一种判断:如果你持续沿着这条轨道前进,而现在看起来我们确实如此,那么对那些将会随之展开的事情进行非常大的投资,会让人感觉非常踏实、非常好。所以我认为,这确实是非常好的消息。
Speaker 554:51 - 55:02
I think the most important aspect of what you're doing there is you're actually closing the loop with the actual biology. Mhmm. Because with code and research, it's closed loop systems. And so they're very fast to iterate. This is an open loop system, so you're closing a loop.
Speaker 554:51 - 55:02
我认为,你们在那里所做事情中最重要的一点,是你们实际上正在和真实的 biology(生物学)闭环。嗯。因为 code 和 research 是闭环系统,所以它们的迭代速度非常快。而这是一个开环系统,所以你们是在把这个环闭合起来。
Speaker 555:02 - 55:04
And that's that's really crucial to progress.
Speaker 555:02 - 55:04
而这对取得进展确实至关重要。
Speaker 255:04 - 55:42
Yeah. For me, one of the biggest changes with the strategy we're driving now, and Alex at the helm is, you know, before we had amazing teams moving generally in the same direction and understanding, like, the potential collaborations and interconnectedness of our work. But now we are arms linked moving together, which It's is very very directed. And it's very exciting. It's a little bit scary, but it's like truly a team playing off each other and trying to make progress towards this goal.
Speaker 255:04 - 55:42
对。对我来说,我们现在推进的这套 strategy(战略)以及由 Alex 掌舵所带来的最大变化之一是,你知道,以前我们有非常出色的 teams(团队),大家大体上朝着同一个方向前进,也理解比如说我们工作的潜在协作关系和相互联结性。但现在,我们是挽着手一起前进,这种方式非常、非常有方向性,也非常令人兴奋。它有一点吓人,但这真的就像一支团队在彼此配合,努力朝着这个目标取得进展。
Speaker 255:42 - 55:54
And that has taken a lot of work, but also the maturity, our teams being able to have their work at a level of maturation where it actually does make sense to interlock.
Speaker 255:42 - 55:54
这花了很多功夫,但同时也离不开一种成熟度——我们的 teams(团队)需要把各自的工作推进到足够成熟的阶段,这样彼此咬合、联动起来才真正有意义。
Speaker 455:54 - 55:57
Amazing. Well, to teams being on the curve, thank you guys for doing this.
Speaker 455:54 - 55:57
太棒了。好吧,祝各个团队都能继续保持在这条曲线上,也谢谢你们来参与这次交流。
Speaker 155:57 - 55:58
Thank you for joining us. Yeah.
Speaker 155:57 - 55:58
谢谢你们加入我们。对。
Speaker 255:58 - 55:59
Thank you.
Speaker 255:58 - 55:59
谢谢。
Speaker 355:59 - 55:59
Thank you.
Speaker 355:59 - 55:59
谢谢。
Speaker 456:02 - 56:18
Find us on Twitter nopriarspod. Subscribe to our YouTube channel if you wanna see our faces. Follow the show on Apple Podcasts, Spotify, or wherever you listen. That way you get a new episode every week. And sign up for emails or find transcripts for every episode at nopriors.com.
Speaker 456:02 - 56:18
在 Twitter 上关注我们:nopriarspod。如果你想看看我们的样子,欢迎订阅我们的 YouTube channel(频道)。也请在 Apple Podcasts、Spotify 或你收听节目的任何平台上关注本节目。这样你每周都会收到一集新节目。你也可以在 nopriors.com 注册邮件,或查看每一期节目的文字稿。
原文 ↗https://www.youtube.com/@NoPriorsPodcast
BuildSpeak — 关于本项目BUILT IN PUBLIC · 跟随 builders 而非 influencers