Skip to content

v0.2.0 - beta

Latest
Compare
Choose a tag to compare
@hkonsti hkonsti released this 12 Jul 00:03
· 7 commits to dev since this release
c6aa2b6

New version!! 😎

Thanks to everyone that tried out Haven since we released our first version! We've implemented your feedback, added some new features and fixed a bunch of bugs!

This is what changed:

  • We added support for mpt-30!
  • You can now add any fine-tuned model to Haven if we support the base-model. Here is a list of all supported base-models which includes llama-7b, llama-13b, mpt-7b, mpt-30b and more. How to add a new model.
  • We now support T4 GPUs which are much cheaper than the A100s
  • We added a pure text-completion endpoint (before we only had chat-completion), so you can now run non-chat models and have finer control over the prompt format.
  • We also fixed a bunch of problems that people ran into and made the setup a little bit easier.
  • If you’re only interested in the orchestration and not in our LLM code you can add your own worker docker-image. This is still super experimental so shoot me a message if you want to do this.

Where to find our docs: https://docs.haven.run
Talk to us on discord: https://discord.com/invite/JDjbfp6q2G