Skip to content

PatristicTextArchive/pta_lexika

Repository files navigation

Word-Lemma-Lists and Lexica used by PTA

The folder sources contains the source files of the used lexica; for more information see the README.md in this folder.

The folder scripts contains the Python scripts used to generate all files.

Greek

The repository has the following files at the moment:

  • Bailly2020.4a.json, LSJ.json, Pape.json, TBESG.json Greek dictionaries compiled from respective source files in sources.
  • pta_lexicon_grc.json:
    • compiled from LSJ, TBESG, and Pape
    • has: lemma – grc_eng – grc_eng2 – grc_deu
    • grc_eng = LSJ, grc_eng2 = TBESG, grc_deu = Pape
    • grc_eng and grc_deu are lists, as there are homonymous lemmata.
    • If there is no entry in one of the dictionaries, the entry is empty.
  • The folder pta_lexicon_grc contains xml-version of the above
  • wordlemma_grc_cltk.json:
  • wordlemma_grc.json (outdated):
  • wordlemma_grc.xml (outdated):
    • xml-version of the file above
  • wordlemma_grc_diogenes.json:
    • morphology data from Diogenes; Greek is converted to utf-8 (from Betacode).
    • has word - lemma (list of possible morphology)
  • JSON-versions of the lexica in the source-folder, adapted for use in PTA, the folder pta_dictionaries contains xml-versions of these.

Latin

Releases

No releases published

Packages

No packages published