[OpenAI]: Encoding Model #31402

keenborder786 · 2025-05-28T23:36:06Z

Description: Small Fix for when getting the encoder in case of KeyError and using the correct encoder for newer models
Issue: self._get_encoding_model of langchain_openai.BaseChatOpenAI class returns incorrect string #31390

vercel · 2025-05-28T23:36:10Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Jun 8, 2025 1:14am

ccurme · 2025-05-29T19:55:45Z

libs/partners/openai/langchain_openai/chat_models/base.py

-            model = "cl100k_base"
-            encoding = tiktoken.get_encoding(model)
+            encoder = "cl100k_base"
+            if self.model_name.startswith("gpt-4o"):


Should this be gpt-4.1?

@ccurme I based on this: https://cookbook.openai.com/examples/how_to_count_tokens_with_tiktoken

4.1 models use the o200k_base encoding

https://huggingface.co/datasets/openai/mrcr#how-to-run
https://community.openai.com/t/whats-the-tokenization-algorithm-gpt-4-1-uses/1245758

okay thanks, will update

codspeed-hq · 2025-05-30T21:06:54Z

CodSpeed Walltime Performance Report

Merging #31402 will not alter performance

_{Comparing keenborder786:fix/encoding (e7c7bab) with master (ece9e31)}

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

Summary

✅ 13 untouched benchmarks

codspeed-hq · 2025-05-30T21:11:55Z

CodSpeed Instrumentation Performance Report

Merging #31402 will not alter performance

_{Comparing keenborder786:fix/encoding (e7c7bab) with master (ece9e31)}

Summary

✅ 13 untouched benchmarks

keenborder786 · 2025-05-31T13:23:44Z

@ccurme

keenborder786 · 2025-06-01T00:41:32Z

@ccurme can you please check?

keenborder786 · 2025-06-04T11:59:53Z

@ccurme

keenborder786 · 2025-06-06T02:11:38Z

@ccurme this is good to go

keenborder786 · 2025-06-08T01:14:21Z

@ccurme , please review and let me know if there is an issue

[fix]: Encoding

2d78741

dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature labels May 28, 2025

keenborder786 changed the title ~~[fix]: Encoding~~ [OpenAI]: Encoding Model May 29, 2025

ccurme reviewed May 29, 2025

View reviewed changes

ccurme self-assigned this May 29, 2025

keenborder786 added 2 commits May 31, 2025 02:05

[fix]: model name

05b2d28

[fix]: model name

9364bc2

[format]

77659e9

keenborder786 added 2 commits June 4, 2025 17:00

Merge branch 'master' into fix/encoding

194c917

Merge branch 'master' into fix/encoding

800d27f

Merge branch 'master' into fix/encoding

e7c7bab

ccurme approved these changes Jun 10, 2025

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Jun 10, 2025

ccurme merged commit 42eb356 into langchain-ai:master Jun 10, 2025
57 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenAI]: Encoding Model #31402

[OpenAI]: Encoding Model #31402

Uh oh!

keenborder786 commented May 28, 2025

Uh oh!

vercel bot commented May 28, 2025 •

edited

Loading

Uh oh!

ccurme May 29, 2025

Uh oh!

keenborder786 May 29, 2025

Uh oh!

luiscavanillas May 30, 2025

Uh oh!

keenborder786 May 30, 2025

Uh oh!

codspeed-hq bot commented May 30, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented May 30, 2025 •

edited

Loading

Uh oh!

keenborder786 commented May 31, 2025

Uh oh!

keenborder786 commented Jun 1, 2025

Uh oh!

keenborder786 commented Jun 4, 2025

Uh oh!

keenborder786 commented Jun 6, 2025

Uh oh!

keenborder786 commented Jun 8, 2025

Uh oh!

Uh oh!

Uh oh!

[OpenAI]: Encoding Model #31402

[OpenAI]: Encoding Model #31402

Uh oh!

Conversation

keenborder786 commented May 28, 2025

Uh oh!

vercel bot commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ccurme May 29, 2025

Choose a reason for hiding this comment

Uh oh!

keenborder786 May 29, 2025

Choose a reason for hiding this comment

Uh oh!

luiscavanillas May 30, 2025

Choose a reason for hiding this comment

Uh oh!

keenborder786 May 30, 2025

Choose a reason for hiding this comment

Uh oh!

codspeed-hq bot commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Walltime Performance Report

Merging #31402 will not alter performance

Summary

Uh oh!

codspeed-hq bot commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Instrumentation Performance Report

Merging #31402 will not alter performance

Summary

Uh oh!

keenborder786 commented May 31, 2025

Uh oh!

keenborder786 commented Jun 1, 2025

Uh oh!

keenborder786 commented Jun 4, 2025

Uh oh!

keenborder786 commented Jun 6, 2025

Uh oh!

keenborder786 commented Jun 8, 2025

Uh oh!

Uh oh!

Uh oh!

vercel bot commented May 28, 2025 •

edited

Loading

codspeed-hq bot commented May 30, 2025 •

edited

Loading

codspeed-hq bot commented May 30, 2025 •

edited

Loading