What about url-defined ollama? Personally, I run open-webui on an outward facing...

sqs · on Aug 23, 2024

Yeah, use this in your VS Code settings to use a different Ollama URL (here it's localhost:11434 but change apiEndpoint, model, and tokens to whatever).

  "cody.dev.models": [
    {
        "provider": "openaicompatible",
        "model": "mixtral:latest",
        "tokens": 4096,
        "apiEndpoint": "http://localhost:11434/v1/chat/completions"
    },
  ],

We should add an easier way to just change the Ollama URL from localhost, so you can see all the Ollama models listed as you can when it's available on localhost. Added to our TODO list!

PhilippGille · on Aug 23, 2024

When I tried Cody around half a year ago it only used Ollama for tab completion while chat still used propriety APIs (or the other way around). Did that change by now so you can prevent any API calls to third parties in the Cody config?

sqs · on Aug 23, 2024

Yes, Cody can use Ollama for both chat and autocomplete. See https://sourcegraph.com/docs/cody/clients/install-vscode#sup.... This lets you use Cody fully offline, but it doesn't /prevent/ API calls to third parties; you are still able to select online models like Claude 3.5 Sonnet.

I have a WIP PR right now (like literally coding on it right now) making Cody support strict offline mode better (i.e., not even showing online models if you choose to be offline): https://github.com/sourcegraph/cody/pull/5221.

PhilippGille · on Aug 30, 2024

That's great, thanks!