improve str and repr #187

eiennohito · 2021-12-02T05:47:42Z

Based on WorksApplications/SudachiPy#166
Fixes #122

There is a slight difference in the proposed format caused by WordId formatting, the implemented version uses (dic_id, word_id)

>>> d = sudachipy.Dictionary()
>>> tok = d.create(sudachipy.SplitMode.A)
>>> mrs = tok.tokenize("外国人参政権")
>>> mrs
<MorphemeList[
  <Morpheme(外国, 0:2, (0, 375175))>,
  <Morpheme(人, 2:3, (0, 284079))>,
  <Morpheme(参政, 3:5, (0, 331513))>,
  <Morpheme(権, 5:6, (0, 522170))>,
]>
>>> str(mrs)
'外国 人 参政 権'
>>> mrs[0]
<Morpheme(外国, 0:2, (0, 375175))>
>>> str(mrs[0])
'外国'

Remaining question:
Should strings be naked as they are now or should we put them into quotes? (<Morpheme('外国', 0:2, (0, 375175))>)

eiennohito added 3 commits December 2, 2021 14:42

improve __str__ and __repr__

fb21dae

add test for morpheme's str and repr

bee0ab7

mention changes in the changelog

be2ca4f

eiennohito requested a review from mh-northlander December 3, 2021 07:29

mh-northlander approved these changes Dec 7, 2021

View reviewed changes

eiennohito merged commit afe1a1e into WorksApplications:develop Dec 7, 2021

eiennohito deleted the 122-ergonomics branch December 7, 2021 07:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve str and repr #187

improve str and repr #187

eiennohito commented Dec 2, 2021 •

edited

Loading

improve __str__ and __repr__ #187

improve __str__ and __repr__ #187

Conversation

eiennohito commented Dec 2, 2021 • edited Loading

improve str and repr #187

improve str and repr #187

eiennohito commented Dec 2, 2021 •

edited

Loading