<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>AI on Chico's Tech Blog</title><link>https://realtime-ai.chat/tags/ai/</link><description>Recent content in AI on Chico's Tech Blog</description><image><title>Chico's Tech Blog</title><url>https://github.com/chicogong.png</url><link>https://github.com/chicogong.png</link></image><generator>Hugo</generator><language>zh-cn</language><lastBuildDate>Sat, 10 Jan 2026 14:00:00 +0800</lastBuildDate><atom:link href="https://realtime-ai.chat/tags/ai/index.xml" rel="self" type="application/rss+xml"/><item><title>声音克隆：60秒复制你的声音，然后呢？</title><link>https://realtime-ai.chat/posts/voice-cloning/</link><pubDate>Sat, 10 Jan 2026 14:00:00 +0800</pubDate><guid>https://realtime-ai.chat/posts/voice-cloning/</guid><description>声音克隆技术现状:60 秒复制一个人的声音有多容易,以及随之而来的诈骗风险与防范。</description><content:encoded><![CDATA[<h2 id="先说个真事">先说个真事</h2>
<p>朋友公司有人收到&quot;老板&quot;的语音消息，让转账50万。声音、语气都对，差点就转了。后来发现是AI克隆的——骗子从老板的抖音视频里扒了几十秒素材。</p>
<p>这就是现在声音克隆的水平：<strong>以假乱真</strong>。</p>
<hr>
<h2 id="60秒能干什么">60秒能干什么</h2>
<p>用ElevenLabs举例：</p>
<div class="highlight"><div class="chroma">
<table class="lntable"><tr><td class="lntd">
<pre tabindex="0" class="chroma"><code><span class="lnt"> 1
</span><span class="lnt"> 2
</span><span class="lnt"> 3
</span><span class="lnt"> 4
</span><span class="lnt"> 5
</span><span class="lnt"> 6
</span><span class="lnt"> 7
</span><span class="lnt"> 8
</span><span class="lnt"> 9
</span><span class="lnt">10
</span><span class="lnt">11
</span><span class="lnt">12
</span><span class="lnt">13
</span></code></pre></td>
<td class="lntd">
<pre tabindex="0" class="chroma"><code class="language-python" data-lang="python"><span class="line"><span class="cl"><span class="kn">from</span> <span class="nn">elevenlabs</span> <span class="kn">import</span> <span class="n">clone</span><span class="p">,</span> <span class="n">generate</span>
</span></span><span class="line"><span class="cl">
</span></span><span class="line"><span class="cl"><span class="c1"># 上传60秒录音</span>
</span></span><span class="line"><span class="cl"><span class="n">voice</span> <span class="o">=</span> <span class="n">clone</span><span class="p">(</span>
</span></span><span class="line"><span class="cl">    <span class="n">name</span><span class="o">=</span><span class="s2">&#34;我的声音&#34;</span><span class="p">,</span>
</span></span><span class="line"><span class="cl">    <span class="n">files</span><span class="o">=</span><span class="p">[</span><span class="s2">&#34;sample.mp3&#34;</span><span class="p">]</span>
</span></span><span class="line"><span class="cl"><span class="p">)</span>
</span></span><span class="line"><span class="cl">
</span></span><span class="line"><span class="cl"><span class="c1"># 让它说任何话</span>
</span></span><span class="line"><span class="cl"><span class="n">audio</span> <span class="o">=</span> <span class="n">generate</span><span class="p">(</span>
</span></span><span class="line"><span class="cl">    <span class="n">text</span><span class="o">=</span><span class="s2">&#34;这话我从没说过&#34;</span><span class="p">,</span>
</span></span><span class="line"><span class="cl">    <span class="n">voice</span><span class="o">=</span><span class="n">voice</span>
</span></span><span class="line"><span class="cl"><span class="p">)</span>
</span></span></code></pre></td></tr></table>
</div>
</div><p>就这么简单。效果好到专业人士都分辨不出。</p>
<hr>
<h2 id="能用来干什么">能用来干什么</h2>
<p><strong>正经用途：</strong></p>
<ul>
<li>有声书制作（成本从10万降到1千）</li>
<li>虚拟主播（24小时不下播）</li>
<li>游戏NPC配音（1000个NPC，1000种声音）</li>
<li>帮失声的人&quot;说话&quot;</li>
</ul>
<p><strong>不正经用途：</strong></p>
<ul>
<li>诈骗（前面说的那种）</li>
<li>伪造录音</li>
<li>未经授权用别人的声音</li>
</ul>
<hr>
<h2 id="怎么防骗">怎么防骗</h2>
<ol>
<li><strong>涉及转账，打电话确认</strong>。语音消息不算数。</li>
<li><strong>设暗号</strong>。家人之间约定一个只有你们知道的词。</li>
<li><strong>听细节</strong>。AI声音太&quot;完美&quot;——没有呼吸声、没有口水音、没有犹豫。</li>
</ol>
<hr>
<h2 id="怎么玩">怎么玩</h2>
<p><strong>免费方案：</strong> Coqui TTS（开源），需要自己部署</p>
<p><strong>付费方案：</strong> ElevenLabs，$11/月起，效果最好</p>
<p><strong>录音技巧：</strong></p>
<ul>
<li>安静环境</li>
<li>正常语速</li>
<li>至少60秒，内容越丰富越好</li>
</ul>
<hr>
<h2 id="配音演员会失业吗">配音演员会失业吗</h2>
<p>低端活会被抢：有声书旁白、广告配音、游戏NPC。</p>
<p>高端活抢不走：需要情感演绎的角色、艺术创作。</p>
<p><strong>新机会：</strong> 授权自己的声音收版权费、做AI配音指导。</p>
<hr>
<h2 id="最后">最后</h2>
<p>技术没有善恶，看人怎么用。</p>
<p>玩声音克隆记得：<strong>用自己的声音玩，别克隆别人的</strong>。</p>
<p>有问题留言。</p>
<hr>
<p><em>相关链接：<a href="https://elevenlabs.io/">ElevenLabs</a> | <a href="https://github.com/coqui-ai/TTS">Coqui TTS</a></em></p>
]]></content:encoded></item></channel></rss>