resize embedding after add_special_tokens #56

Switchsyj · 2024-09-03T03:20:16Z

Hi, thanks for your great work! I would like to point out a potential bug in this code:
add_special_tokens without checking embedding size is very dangerous especially for llama. In fact, llama use <end_of_text> as eos and bos token during training. Otherwise, you need to resize the embedding after add_special_tokens '' or it would out of bounds while torch.gather.

Code line:

RRHF/train.py

Line 302 in e1a2b61

if "llama" in model_args.model_name_or_path:

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resize embedding after add_special_tokens #56

resize embedding after add_special_tokens #56

Switchsyj commented Sep 3, 2024

resize embedding after add_special_tokens #56

resize embedding after add_special_tokens #56

Comments

Switchsyj commented Sep 3, 2024