feat: Support `response_format` and structured JSON responses. #3785

actow · 2024-06-24T05:25:30Z

I have searched the existing issues

Is your feature request related to a problem? Please describe it

There is not a away to force the model to return structured json at the API level.

Describe the solution

The response_format parameter is supported by certain models, such as Groq's llama3 (8b & 70b), Fireworks AI's llama3 70b and OpenAI gpt 3.5 and 4 etc.

https://console.groq.com/docs/api-reference#chat-create
https://platform.openai.com/docs/api-reference/audio/createSpeech#audio-createspeech-response_format
https://readme.fireworks.ai/docs/structured-response-formatting

If will be great if Jan provide a UI to set that.

Teachability, documentation, adoption, migration strategy

https://console.groq.com/docs/api-reference#chat-create
https://platform.openai.com/docs/api-reference/audio/createSpeech#audio-createspeech-response_format
https://readme.fireworks.ai/docs/structured-response-formatting

What is the motivation / use case for changing the behavior?

The JSON structure response is very useful to experiment the models' capability to extend beyond a normal chat bot.

The text was updated successfully, but these errors were encountered:

dan-menlo · 2024-09-08T12:07:29Z

@nguyenhoangthuan99 I'm marking this for Sprint 20, but it's possible that this is not a simple upstream of llama.cpp capability.

Can you take a look and assess
If it's too big, we will postpone to Sprint 21

dan-menlo · 2024-09-29T06:29:25Z

Also: linked to menloresearch/cortex.cpp#295?

actow added the type: feature request A new feature label Jun 24, 2024

imtuyethan self-assigned this Jul 2, 2024

imtuyethan removed their assignment Aug 28, 2024

imtuyethan transferred this issue from menloresearch/jan Sep 2, 2024

freelerobot added the category: app shell Installation, updater, global application issues label Sep 6, 2024

dan-menlo assigned nguyenhoangthuan99 Sep 8, 2024

dan-menlo assigned louis-menlo Sep 9, 2024

dan-menlo mentioned this issue Sep 11, 2024

epic: llama.cpp params are settable via API call or model.yaml menloresearch/cortex.cpp#1151

Closed

7 tasks

dan-menlo unassigned louis-menlo Sep 29, 2024

gabrielle-ong transferred this issue from menloresearch/cortex.cpp Oct 13, 2024

gabrielle-ong added the category: providers Local & remote inference providers label Oct 13, 2024

gabrielle-ong moved this from Icebox to Investigating in Menlo Oct 13, 2024

freelerobot added category: model settings Inference params, presets, templates and removed category: app shell Installation, updater, global application issues labels Oct 15, 2024

freelerobot moved this from Investigating to Planning in Menlo Oct 15, 2024

imtuyethan mentioned this issue Oct 18, 2024

chore: Structure Icebox in Github Projects #3840

Closed

gabrielle-ong mentioned this issue Oct 21, 2024

feat: Cortex supports Function Calling menloresearch/cortex.cpp#295

Closed

1 task

imtuyethan added the Cortex-related label Nov 3, 2024

nguyenhoangthuan99 moved this from Planning to In Progress in Menlo Nov 29, 2024

This was referenced Nov 29, 2024

Feat/structured output menloresearch/cortex.llamacpp#308

Merged

Chore/update structured output documentation menloresearch/cortex.cpp#1749

Merged

imtuyethan moved this from In Progress to In Review in Menlo Nov 29, 2024

nguyenhoangthuan99 closed this as completed in menloresearch/cortex.llamacpp#308 Dec 1, 2024

github-project-automation bot moved this from In Review to Review + QA in Menlo Dec 1, 2024

imtuyethan moved this from QA to Completed in Menlo Dec 3, 2024

vansangpfiev mentioned this issue Dec 27, 2024

Replace cortex.llamacpp with minimalist fork of llama.cpp menloresearch/cortex.cpp#1728

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Support `response_format` and structured JSON responses. #3785

feat: Support `response_format` and structured JSON responses. #3785

actow commented Jun 24, 2024

dan-menlo commented Sep 8, 2024

Uh oh!

dan-menlo commented Sep 29, 2024

Uh oh!

feat: Support response_format and structured JSON responses. #3785

feat: Support response_format and structured JSON responses. #3785

Comments

actow commented Jun 24, 2024

Is your feature request related to a problem? Please describe it

Describe the solution

Teachability, documentation, adoption, migration strategy

What is the motivation / use case for changing the behavior?

dan-menlo commented Sep 8, 2024

Uh oh!

dan-menlo commented Sep 29, 2024

Uh oh!

feat: Support `response_format` and structured JSON responses. #3785

feat: Support `response_format` and structured JSON responses. #3785