Support for Gemma 2? #115

sdmorrey · 2024-08-04T23:36:36Z

What would be required to support Gemma 2?
I'd be happy to chip in and help with the code, I just need to have a bit of insight into what would need to be changed?

b4rtaz · 2024-08-05T21:22:47Z

Hello @sdmorrey,

you should check llama2-tasks.cpp and grok1-tasks.cpp files. For different architectures DL builds a different task list. Tasks are reused of course (in grok1-tasks.cpp you can see the implementation of different tasks than Llama model uses).

I see Gemma 2 has more norm layers. Rope layer it seems it's already implemented (FalconRopeCommand). Probably the tokenizer is something that may need more work (converter), but I'm not sure.

unclemusclez · 2024-08-25T21:41:28Z

+1 for Gemma 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Gemma 2? #115

Support for Gemma 2? #115

sdmorrey commented Aug 4, 2024

b4rtaz commented Aug 5, 2024 •

edited

Loading

unclemusclez commented Aug 25, 2024

Support for Gemma 2? #115

Support for Gemma 2? #115

Comments

sdmorrey commented Aug 4, 2024

b4rtaz commented Aug 5, 2024 • edited Loading

unclemusclez commented Aug 25, 2024

b4rtaz commented Aug 5, 2024 •

edited

Loading