🐦 X · 动态Zara Zhang @zarazhangrui· 2026 年 5 月 9 日· 104 词 · 约 1 分钟

Zara Zhang · @zarazhangrui

SPACE 播放 / 暂停 上一句 / 下一句
Built a "YouTube realtime copilot" browser extension using OpenAI's realtime 2 API: The agent watches the video alongside you, and can answer any question you have about what was just said via realtime voice chat. The crazy part to me is: It can differentiate the YouTube's audio stream and your voice, so it doesn't confuse the video as commands, and stays silent unless you ask something!
我用 OpenAI 的 realtime 2 API 做了一个“YouTube realtime copilot”浏览器扩展:这个 agent 会和你一起“观看”视频,并且可以通过 realtime(实时)语音聊天,回答你关于刚刚说了什么的任何问题。对我来说最疯狂的是:它能区分 YouTube 的音频流和你的声音,所以不会把视频内容误当成指令,而且除非你主动提问,否则它会保持安静!
All 32 Beautiful HTML Slide Templates are now available on AnyGen, it's plug-and-play even for those without a coding agent Use them now:
现在,32 个精美的 HTML 幻灯片模板都已经上线 AnyGen 了;即使是没有 coding agent 的人也能即插即用。现在就用吧:
Can confirm GPT realtime 2 feels like black magic; unlocks so many new applications!! Building with it now
可以确认,GPT realtime 2 感觉就像黑魔法一样;它解锁了太多全新的应用场景!!我现在就在用它做开发
原文 ↗https://x.com/zarazhangrui