Skip to content

Commit 3703754

Browse files
authored
bug fix (#3252)
1 parent 0ae4560 commit 3703754

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

xpu/2.3.110+xpu/tutorials/api_doc.html

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -338,15 +338,15 @@ <h2>General<a class="headerlink" href="#general" title="Link to this heading">
338338
</dl>
339339
<div class="admonition warning">
340340
<p class="admonition-title">Warning</p>
341-
<p>Please invoke <code class="docutils literal notranslate"><span class="pre">optimize_transformers</span></code> function AFTER invoking DeepSpeed in Tensor Parallel
341+
<p>Please invoke <code class="docutils literal notranslate"><span class="pre">ipex.llm.optimize</span></code> function AFTER invoking DeepSpeed in Tensor Parallel
342342
inference scenario.</p>
343343
</div>
344344
<p class="rubric">Examples</p>
345345
<div class="doctest highlight-default notranslate"><div class="highlight"><pre><span></span><span class="gp">&gt;&gt;&gt; </span><span class="c1"># bfloat16 generation inference case.</span>
346346
<span class="gp">&gt;&gt;&gt; </span><span class="n">model</span> <span class="o">=</span> <span class="o">...</span>
347347
<span class="gp">&gt;&gt;&gt; </span><span class="n">model</span><span class="o">.</span><span class="n">load_state_dict</span><span class="p">(</span><span class="n">torch</span><span class="o">.</span><span class="n">load</span><span class="p">(</span><span class="n">PATH</span><span class="p">))</span>
348348
<span class="gp">&gt;&gt;&gt; </span><span class="n">model</span><span class="o">.</span><span class="n">eval</span><span class="p">()</span>
349-
<span class="gp">&gt;&gt;&gt; </span><span class="n">optimized_model</span> <span class="o">=</span> <span class="n">ipex</span><span class="o">.</span><span class="n">optimize_transformers</span><span class="p">(</span><span class="n">model</span><span class="p">,</span> <span class="n">dtype</span><span class="o">=</span><span class="n">torch</span><span class="o">.</span><span class="n">bfloat16</span><span class="p">)</span>
349+
<span class="gp">&gt;&gt;&gt; </span><span class="n">optimized_model</span> <span class="o">=</span> <span class="n">ipex</span><span class="o">.</span><span class="n">llm</span><span class="o">.</span><span class="n">optimize</span><span class="p">(</span><span class="n">model</span><span class="p">,</span> <span class="n">dtype</span><span class="o">=</span><span class="n">torch</span><span class="o">.</span><span class="n">bfloat16</span><span class="p">)</span>
350350
<span class="gp">&gt;&gt;&gt; </span><span class="n">optimized_model</span><span class="o">.</span><span class="n">generate</span><span class="p">()</span>
351351
</pre></div>
352352
</div>
@@ -789,4 +789,4 @@ <h2>C++ API<a class="headerlink" href="#c-api" title="Link to this heading"><
789789
</script>
790790

791791
</body>
792-
</html>
792+
</html>

0 commit comments

Comments
 (0)