feature: Support new OpenAI o1 reasoning models #575

abatilo · 2024-09-12T20:27:10Z

Feature request

I would like if Avante could prompt against the OAI reasoning models

Motivation

These reasoning models are allegedly more capable at coding.

Other

There are a few differences in the API now:

temperature must be set to 1 if you're using either o1-preview or o1-mini models.
max_tokens is not used. Instead max_completion_tokens must be used in the config
These models do not support a system role/message. I think we need to remove this line
Streaming is not supported, so I believe we need to remove this line.

Even with these changes, I'm not getting a successful end to end flow. I've never contributed to the avante.nvim codebase and I'm not entirely sure what else to try at the moment to get things working.

So far, my total diffs look like so:

diff --git a/lua/avante/config.lua b/lua/avante/config.lua
index c1689d7..82d3477 100644
--- a/lua/avante/config.lua
+++ b/lua/avante/config.lua
@@ -30,8 +30,8 @@ You are an excellent programming expert.
     endpoint = "https://api.openai.com/v1",
     model = "gpt-4o",
     timeout = 30000, -- Timeout in milliseconds
-    temperature = 0,
-    max_tokens = 4096,
+    temperature = 1,
+    max_completion_tokens = 4096,
     ["local"] = false,
   },
   ---@type AvanteSupportedProvider
diff --git a/lua/avante/providers/openai.lua b/lua/avante/providers/openai.lua
index 52e62b1..888d466 100644
--- a/lua/avante/providers/openai.lua
+++ b/lua/avante/providers/openai.lua
@@ -51,7 +51,6 @@ M.parse_message = function(opts)
   end

   return {
-    { role = "system", content = opts.system_prompt },
     { role = "user", content = user_content },
   }
 end
@@ -91,7 +90,6 @@ M.parse_curl_args = function(provider, code_opts)
     body = vim.tbl_deep_extend("force", {
       model = base.model,
       messages = M.parse_message(code_opts),
-      stream = true,
     }, body_opts),
   }
 end

I've gotten this far by trying to use the openai provider and seeing it fail and return an error message. This time, it's not returning anything. I see Generating response ... and it never changes.

The text was updated successfully, but these errors were encountered:

Alextibtab · 2024-09-13T10:42:20Z

I assume they'll open it up in the near future but currently to use the reasoning models via the API you need to hit tier 5 https://platform.openai.com/docs/guides/rate-limits/usage-tiers?context=tier-five meaning you have to have bought $1000 in tokens

cfcosta · 2024-09-13T12:58:53Z

@Alextibtab if you need to test anything I have this level of API access.

aarnphm · 2024-09-13T14:58:25Z

lmao if you have tier 5 then feel free to use it. For now we have to wait till GA for API usage.

btw the API costs would be pretty high for o1 from my testing.

oskarpyk · 2024-09-25T14:08:24Z

@abatilo I have access and just tried to implement your diff-- experiencing the same freeze at Generating response... unfortunately. Presumably there's some deeper logic in Avante interfering with the o1 non-streamed response mechanic?

aarnphm · 2024-09-25T15:18:35Z

I mean they haven't even publish o1 API yet, so there is nothing we can do for sure.

I suspect it is still streaming, just that we need to figure out how to display CoT reasoning

abatilo · 2024-09-25T15:33:30Z

@aarnphm There might be a misunderstanding. There is an API but it's closed to certain tiers. It's mostly the same as the current chat completion endpoint but doesn't support streaming. According to OAI, they don't plan on returning the full CoT tokens. Unless maybe they've changed their mind.

I think we would mostly just need to add code paths for handling non-streaming results to make this work

aarnphm · 2024-09-25T16:04:07Z

There is an API but it's closed to certain tiers.

Yes the API is open for tier 5 and up. What I'm referring to is on their API reference they have yet to update the example for o1, or specify specific data_type for SSE during CoT.

I don't think they will ever publish CoT tokens (that is their moat apparently). But this is irrelevant in this case.

The chat after CoT are still streaming afaict

LessComplexity · 2024-09-26T09:54:04Z

I've added a pull request that adds O1 support, without interfering with other models, and also solving the response hanging issue people had experienced in this thread.
Waiting for a review and approval so that people with tier 5 API access could enjoy it too :)

Have an awesome day guys <3

oskarpyk · 2024-09-26T10:01:58Z

Well done @LessComplexity ! Fantastic work

LessComplexity · 2024-09-28T07:16:44Z

@aarnphm
This issue can be closed as my commit already adds o1 models support :)

aarnphm · 2024-09-28T13:07:20Z

thanks

abatilo added the enhancement New feature or request label Sep 12, 2024

aarnphm closed this as completed Sep 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: Support new OpenAI o1 reasoning models #575

feature: Support new OpenAI o1 reasoning models #575

abatilo commented Sep 12, 2024

Alextibtab commented Sep 13, 2024

cfcosta commented Sep 13, 2024

aarnphm commented Sep 13, 2024

oskarpyk commented Sep 25, 2024

aarnphm commented Sep 25, 2024

abatilo commented Sep 25, 2024

aarnphm commented Sep 25, 2024 •

edited

Loading

LessComplexity commented Sep 26, 2024

oskarpyk commented Sep 26, 2024

LessComplexity commented Sep 28, 2024

aarnphm commented Sep 28, 2024

feature: Support new OpenAI o1 reasoning models #575

feature: Support new OpenAI o1 reasoning models #575

Comments

abatilo commented Sep 12, 2024

Feature request

Motivation

Other

Alextibtab commented Sep 13, 2024

cfcosta commented Sep 13, 2024

aarnphm commented Sep 13, 2024

oskarpyk commented Sep 25, 2024

aarnphm commented Sep 25, 2024

abatilo commented Sep 25, 2024

aarnphm commented Sep 25, 2024 • edited Loading

LessComplexity commented Sep 26, 2024

oskarpyk commented Sep 26, 2024

LessComplexity commented Sep 28, 2024

aarnphm commented Sep 28, 2024

aarnphm commented Sep 25, 2024 •

edited

Loading