Skip to content

Release v1.4.2 🚀

Compare
Choose a tag to compare
@kooyunmo kooyunmo released this 21 Jul 07:27
· 5 commits to main since this release
  • Support for Tool Calling API: Added new API to support tool calling.
  • Phi3 INT8 Support: Implemented support for Phi3 INT8.
  • Snowflake Arctic FP8 Quantizer: Introduced new quantizer for Snowflake Arctic FP8.
  • Added support for INT8 quantization for Llama and refactored quantizer to use only safetensors.