Llamacpp has Attributeerror with token_eos for Hermest-Pro-7B when using structured grammar generation #771

maxtheman · 2024-03-27T01:17:22Z

Describe the issue as clearly as possible:

When I try to use Hermes-Pro-7b with llama-cpp-python, I cannot use cfg to generate structured grammar

This is ONLY an issue with structured grammar generation via cfg. generate.json doesn't

Steps/code to reproduce the bug:

from outlines.grammars import json as json_lark
from outlines.models import llamacpp
MODEL_URL = "./Hermes-2-Pro-Mistral-7B.Q8_0.gguf"
model = llamacpp(MODEL_URL)
# fails on line below
generator = cfg(model, json_lark)
sequence = generator("Alice had 4 apples and Bob ate 2. Write an expression for Alice's apples:")
print(sequence)

Expected result:

I'd expect the Attributeerror to not occur, and instead the sequence should print to Stdout.

Error message:

---> [51](.venv/lib/python3.11/site-packages/outlines/generate/cfg.py:51) logits_processor = CFGLogitsProcessor(cfg_str, model.tokenizer)
     [52](.venv/lib/python3.11/site-packages/outlines/generate/cfg.py:52) generator = LlamaSequenceGenerator(logits_processor, model)
     [54](.venv/lib/python3.11/site-packages/outlines/generate/cfg.py:54) return generator
...
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
Cell In[23], line 22
     21 # import pdb; pdb.set_trace()
---> 22 generator = cfg(model, json_lark)
     23 # generator = generate_json(model, Character)
     24 # generator = extract_json(model)
     25 sequence = generator("Alice had 4 apples and Bob ate 2. Write an expression for Alice's apples:")
---> [46](.venv/lib/python3.11/site-packages/outlines/integrations/llamacpp.py:46)     self.eos_token_id = model.token_eos()
     [47](venv/lib/python3.11/site-packages/outlines/integrations/llamacpp.py:47)     self.pad_token_id = self.eos_token_id
     [48](venv/lib/python3.11/site-packages/outlines/integrations/llamacpp.py:48)     self.special_tokens: Set[int] = set()

AttributeError: 'LlamaCppTokenizer' object has no attribute 'token_eos'



### Outlines/Python version information:

Version information
outlines version 0.0.37
Python 3.11.3
Managed by rye

### Context for the issue:

Hermes Pro 2 is Nous Research's newest and best model, and I suspect it's quite good at JSON schema creation because of its fine-tuning on tool calling.

I'd like to experiment with that to see if it's the case.

The text was updated successfully, but these errors were encountered:

sharanry · 2024-03-29T20:52:55Z

Facing the same issue with Mistral Instruct:

from llama_cpp import Llama
from outlines import models, generate

arithmetic_grammar = """
    ?start: expression

    ?expression: term (("+" | "-") term)*

    ?term: factor (("*" | "/") factor)*

    ?factor: NUMBER
           | "-" factor
           | "(" expression ")"

    %import common.NUMBER
"""


llm = Llama.from_pretrained(
    repo_id="TheBloke/Mistral-7B-Instruct-v0.1-GGUF",
    filename="mistral-7b-instruct-v0.1.Q4_K_S.gguf",
    verbose=True
)
model = models.LlamaCpp(llm)

generator = generate.cfg(model, arithmetic_grammar)

sequence = generator(
  "Alice had 4 apples and Bob ate 2. "
  + "Write an expression for Alice's apples:"
)

rlouf · 2024-03-30T17:05:05Z

I am so sorry this is happening! I will investigate early next week.

maxtheman · 2024-05-05T00:31:00Z

@rlouf I was just trying to use outlines again, this time with phi-3, still no luck, but I did make some progress.

It seems like the issue is that LlamaTokenizer in llama-cpp-python is fundamentally a different object than what you're expecting in SequenceGenerator, which appears to be.

You can patch on the attributes needed with something like the following:

class HackedLlamaTokenizer(LlamaTokenizer):
    def __init__(self, llama: Llama, eos_token_id: int):
        self._model = llama._model
        self.eos_token_id = eos_token_id
        
if __name__ == "__main__":
    model = Llama(model_path="./phi-3-4k/Phi-3-mini-4k-instruct-q4.gguf")
    model_tokenizer = HackedLlamaTokenizer(model, model._token_eos)
    test_str = "tresasdfasdf"
    print(model_tokenizer.encode(test_str))
    model.device = 'mps'
    # Patch the tokenizer method with the instantiated tokenizer object
    model.tokenizer = model_tokenizer
...

This doesn't work because tokenizer.encode is expecting a str, not a list. So I patched that:

    def encode(
        self, text: str, add_bos: bool = True, special: bool = True
    ) -> ([List[int], List[int]]):
        print(text)
        return self.tokenize(
            text[0].encode("utf-8", errors="ignore"), add_bos=add_bos, special=special
        )

But then I run into an error at line 176 of api.py in SequenceGenerator:
prompt_token_ids, attention_masks = self.tokenizer.encode(prompts)
Because the LLamaTokenizer doesn't return attention masks.

At this point I'm out of my depth. I don't quite understand what attention masks are — why would you want to ignore tokens? Why wouldn't this tokenizer return it if you're expecting it?

But, I decided to try to just say 'let's not ignore any of them' and patched it in:

   def tokenize(
        self, text: bytes, add_bos: bool = True, special: bool = True
    ) -> ([List[int], List[int]]):
        tokens = self._model.tokenize(text, add_bos=add_bos, special=special)
        ones_array = np.ones_like(tokens)
        return tokens, ones_array

Which resulted in this error:

  File ".venv/lib/python3.12/site-packages/outlines/generate/api.py", line 177, in __call__
    prompt_token_ids = prompt_token_ids.to(self.device)
                       ^^^^^^^^^^^^^^^^^^^

I'm pretty sure there's something broken with the llama-cpp integration. Even this, from the examples

# curl -L -o mistral-7b-instruct-v0.2.Q5_K_M.gguf https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GGUF/resolve/main/mistral-7b-instruct-v0.2.Q5_K_M.gguf
    model = outlines.models.llamacpp("./mistral-7b-instruct-v0.2.Q5_K_M.gguf")

Throws an error since you need to create LLama manually, which if you fix it leads again to the error:

AttributeError: 'function' object has no attribute 'eos_token_id'

So, something has probably changed in llama-cpp-python since the integration was created which broke this.

Thanks for taking a look!

lapp0 · 2024-05-16T22:10:35Z

@maxtheman I couldn't reproduce the error in your script. Is it possible aacc633 fixed it? Please let me know if the issue still occurs on the latest version of outlines.

rlouf · 2024-05-19T11:48:08Z

Yes, it would have fixed it. Please update outlines to use the latest version and give it another try.

maxtheman · 2024-05-19T16:22:37Z

Oh excellent! I do still have a project I can try this on. Away from my computer this weekend but will confirm tomorrow.

…

On Sun, May 19, 2024, 4:48 AM Rémi Louf ***@***.***> wrote: Yes, it would have fixed it. Please update outlines to use the latest version and give it another try. — Reply to this email directly, view it on GitHub <#771 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAQSOUJKJH7EAZFZ4TIV2NTZDCGQ5AVCNFSM6AAAAABFJ624C2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMJZGIYDOMJWGU> . You are receiving this because you were mentioned.Message ID: ***@***.***>

rlouf · 2024-05-23T11:01:45Z

Appears to be solved. Please reopen if that’s not the case.

maxtheman added the bug label Mar 27, 2024

brandonwillard added the grammar label May 9, 2024

lapp0 pushed a commit to lapp0/outlines that referenced this issue May 16, 2024

minimal reproduction test for issue dottxt-ai#771

cb916e3

rlouf closed this as completed May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llamacpp has Attributeerror with token_eos for Hermest-Pro-7B when using structured grammar generation #771

Llamacpp has Attributeerror with token_eos for Hermest-Pro-7B when using structured grammar generation #771

maxtheman commented Mar 27, 2024

sharanry commented Mar 29, 2024 •

edited

Loading

rlouf commented Mar 30, 2024

maxtheman commented May 5, 2024

lapp0 commented May 16, 2024

rlouf commented May 19, 2024

maxtheman commented May 19, 2024 via email

rlouf commented May 23, 2024

Llamacpp has Attributeerror with token_eos for Hermest-Pro-7B when using structured grammar generation #771

Llamacpp has Attributeerror with token_eos for Hermest-Pro-7B when using structured grammar generation #771

Comments

maxtheman commented Mar 27, 2024

Describe the issue as clearly as possible:

Steps/code to reproduce the bug:

Expected result:

Error message:

sharanry commented Mar 29, 2024 • edited Loading

rlouf commented Mar 30, 2024

maxtheman commented May 5, 2024

lapp0 commented May 16, 2024

rlouf commented May 19, 2024

maxtheman commented May 19, 2024 via email

rlouf commented May 23, 2024

sharanry commented Mar 29, 2024 •

edited

Loading