Release v0.2.0 - beta · redotvideo/haven

New version!! 😎

Thanks to everyone that tried out Haven since we released our first version! We've implemented your feedback, added some new features and fixed a bunch of bugs!

This is what changed:

We added support for mpt-30!
You can now add any fine-tuned model to Haven if we support the base-model. Here is a list of all supported base-models which includes llama-7b, llama-13b, mpt-7b, mpt-30b and more. How to add a new model.
We now support T4 GPUs which are much cheaper than the A100s
We added a pure text-completion endpoint (before we only had chat-completion), so you can now run non-chat models and have finer control over the prompt format.
We also fixed a bunch of problems that people ran into and made the setup a little bit easier.
If you’re only interested in the orchestration and not in our LLM code you can add your own worker docker-image. This is still super experimental so shoot me a message if you want to do this.

Where to find our docs: https://docs.haven.run
Talk to us on discord: https://discord.com/invite/JDjbfp6q2G

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.2.0 - beta

New version!! 😎

This is what changed: