Query server for model availability of OpenAI-compatible servers #609

surak · 2023-11-07T16:02:41Z

Validations

I believe this is a way to improve. I'll try to join the Continue Discord for questions
I'm not able to find an open issue that requests the same enhancement

Problem

OpenAI-compatible inference servers have the /v1/models endpoint which lists their models and capabilities. e.g.:

curl --header "Authorization: Bearer MY_OPENAI_KEY"   https://api.openai.com/v1/models

This returns a JSON list of the models and some of their capabilities. Compatible servers implement the same.

This would make the configuration of such servers much easier. Instead of writing the config.py manually, one can just query the server for the models...

Solution

No response

The text was updated successfully, but these errors were encountered:

sestinj · 2023-11-07T20:07:09Z

To be sure, the use case here is to populate the dropdown if you're using the OpenAI class with a personal API token?

surak · 2023-11-07T20:51:56Z

That or any OpenAI model server, where models can have completely different names from that.

but yea, the autopopulate part is correct

simon376 · 2023-11-09T14:01:41Z

i tried using continue with a small team, and it was horrendous to set up, especially since we had to switch models a few times.
A way to automatically set things up would be great.
e.g. when using TGI there's a /info endpoint that gives model name, context window, and more information. this could be used to configure the prompt template, special tokens, and so on automatically by only having the user select TGI + entering the URL.

If I can help let me know

sestinj · 2023-11-09T21:32:22Z

@simon376 Thanks for the honesty here. Model setup is something we're planning to spend most of next week on, so if you don't mind I'd actually love to hear the entirety of your experience. Is GitHub issues the best way to communicate, Discord, or even a quick call? Whatever is most comfortable. I'll have many questions and can hopefully share some early ideas with you

Our goal is like you said, to get this to a point where you wouldn't need to worry about the config file at all

simon376 · 2023-11-10T11:51:19Z

I will message you on discord when I find the time. my choice of words was a bit harsh tbh 😅 maybe I was a bit ill-prepared

sestinj · 2023-11-16T20:47:19Z

No worries! "horrendous" was properly descriptive in my opinion : )

Many of the noted improvements are now available in the pre-release. Still yet to autodetect available models, but I still think this is a good idea

surak · 2024-03-08T18:04:12Z

I am using the pre-release version v.09.80 now.

At some point I actually got it running! Wow!

Now, I can't get it. I am writing a guide for my users, and tried doing it step-by-step, but couldn't get it to probe.

Some things I noticed when trying it:

The "autodetect" option defaults to localhost:8000 and opens the config.json - It could be a step where you actually add the URL of the inference server (this is hidden on the advanced part).
There's no parameter to add an api key. I would argue that if the server probed beforehand returns a negative talking about an api key, a further field should open. ONLY THEN present the user with the list of models from said server.

If I add a model with the name AUTODETECT and no title, like this:

    {
      "model": "AUTODETECT",
      "apiKey": "HERE_GOES_MY_TOKEN",
      "apiBase": "https://helmholtz-blablador.fz-juelich.de:8000/v1",
      "completionOptions": {},
      "provider": "openai"
    }

I end up with a model named "custom" on the model list, which points to... I don't know whom. Its response don't match my models.

github-actions · 2025-03-15T02:06:09Z

This issue was closed because it wasn't updated for 10 days after being marked stale. If it's still important, please reopen + comment and we'll gladly take another look!

surak added the enhancement label Nov 7, 2023

simon376 mentioned this issue Jan 16, 2024

Query 🤗TGI /info endpoint to get model name and context window #755

Merged

simon376 mentioned this issue Jul 2, 2024

Query vLLM OpenAI /models endpoint to get model name and context window #1632

Merged

1 task

dosubot bot added kind:enhancement Indicates a new feature request, imrovement, or extension and removed enhancement labels Jul 8, 2024

RomneyDa added the needs-triage label Oct 31, 2024

Patrick-Erichsen removed the needs-triage label Nov 25, 2024

github-actions bot added the stale label Mar 3, 2025

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query server for model availability of OpenAI-compatible servers #609

Query server for model availability of OpenAI-compatible servers #609

surak commented Nov 7, 2023

sestinj commented Nov 7, 2023

surak commented Nov 7, 2023

simon376 commented Nov 9, 2023

sestinj commented Nov 9, 2023

simon376 commented Nov 10, 2023

sestinj commented Nov 16, 2023

surak commented Mar 8, 2024

github-actions bot commented Mar 15, 2025

Query server for model availability of OpenAI-compatible servers #609

Query server for model availability of OpenAI-compatible servers #609

Comments

surak commented Nov 7, 2023

Validations

Problem

Solution

sestinj commented Nov 7, 2023

surak commented Nov 7, 2023

simon376 commented Nov 9, 2023

sestinj commented Nov 9, 2023

simon376 commented Nov 10, 2023

sestinj commented Nov 16, 2023

surak commented Mar 8, 2024

github-actions bot commented Mar 15, 2025