IndexError: index 2002 is out of bounds for dimension 0 with size 768 #11

Dheeraj-kkde · 2024-01-31T16:56:37Z

Initially I was getting the the following Error:

OSError: Can't load tokenizer for 'bert-base-uncased'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'bert-base-uncased' is the correct path to a directory containing all relevant files for a BertTokenizer tokenizer.

the above error is same for all the LM models mentioned in README.md file.

After I used the following post to resolve that : #https://stackoverflow.com/questions/69286889/transformers-and-bert-downloading-to-your-local-machine

Later we getting the below error:

PS C:\Users\dheerajkumar11\AI-CoE\Trusted AI-Fairness Metrics\crows-pairs-master> python metric.py --input_file data/crows_pairs_anonymized.csv --lm_model bert --output_file
ERROR:
Traceback (most recent call last):
File "C:\Users\dheerajkumar11\AI-CoE\Trusted AI-Fairness Metrics\crows-pairs-master\metric.py", line 296, in
evaluate(args)
File "C:\Users\dheerajkumar11\AI-CoE\Trusted AI-Fairness Metrics\crows-pairs-master\metric.py", line 234, in evaluate
score = mask_unigram(data, lm)
File "C:\Users\dheerajkumar11\AI-CoE\Trusted AI-Fairness Metrics\crows-pairs-master\metric.py", line 149, in mask_unigram
score1 = get_log_prob_unigram(sent1_masked_token_ids, sent1_token_ids, template1[i], lm)
File "C:\Users\dheerajkumar11\AI-CoE\Trusted AI-Fairness Metrics\crows-pairs-master\metric.py", line 74, in get_log_prob_unigram
log_probs = log_softmax(hs)[target_id]
IndexError: index 2002 is out of bounds for dimension 0 with size 768

My findings:

In the metric.py file,
hs = hidden_states[mask_idx]
target_id = token_ids[0][mask_idx]
log_probs = log_softmax(hs)[target_id]

here the 
hs.size()
torch.Size([768])

Seems it's breaking in the LM Logic, Can someone please look into the error as it's not even working for the sample csv file provided in the repo.

Thanks and regards,
Dheeraj Kumar

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IndexError: index 2002 is out of bounds for dimension 0 with size 768 #11

IndexError: index 2002 is out of bounds for dimension 0 with size 768 #11

Dheeraj-kkde commented Jan 31, 2024

IndexError: index 2002 is out of bounds for dimension 0 with size 768 #11

IndexError: index 2002 is out of bounds for dimension 0 with size 768 #11

Comments

Dheeraj-kkde commented Jan 31, 2024