Add a `Float` generation method #154

rlouf · 2023-06-21T15:58:19Z

We can steer the generation process so we only output floats. I propose to implement a Float subclass of Sequence that uses masking to restrict the generated tokens to floats. The mask is a function of the tokens that have already been generated: if we sequence generated so far contains a period we can only generate integers. We will need to add a create_proposal method to Sequence that applies the mask to the logits generated by the model.

We can also add constraints on the generated floats:

Enforce a given precision
Add an upper or lower bound on the value of the float

We will probably need SMC sampling to implement the constraints.

The text was updated successfully, but these errors were encountered:

brandonwillard · 2023-06-22T17:02:33Z

If we're talking about regex-driven float parsing, we can also use the vocabulary pre-processing approach described in #131 and this Gist. That could possibly turn the process of determining valid next tokens (and/or the respective indices to be masked) into something closer to a simple dict look-up.

brandonwillard · 2023-07-06T20:48:33Z

Looks like the current float masking occasionally produces strings like ".801.4" in test_hf_transformers.test_type_float: see the CI failure in a run of #172 here.

The FSM-based pre-processing approach mentioned above and utilized in #166 should fix that.

rlouf added text Linked to text generation enhancement transformers Linked to the `transformers` integration and removed transformers Linked to the `transformers` integration labels Jun 21, 2023

rlouf mentioned this issue Jun 21, 2023

Add OpenAI interface for the new generation API #157

Closed

rlouf linked a pull request Jul 11, 2023 that will close this issue

Add Regex generation method #175

Merged

4 tasks

rlouf closed this as completed in #175 Jul 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a `Float` generation method #154

Add a `Float` generation method #154

rlouf commented Jun 21, 2023 •

edited

Loading

brandonwillard commented Jun 22, 2023

brandonwillard commented Jul 6, 2023

Add a Float generation method #154

Add a Float generation method #154

Comments

rlouf commented Jun 21, 2023 • edited Loading

brandonwillard commented Jun 22, 2023

brandonwillard commented Jul 6, 2023

Add a `Float` generation method #154

Add a `Float` generation method #154

rlouf commented Jun 21, 2023 •

edited

Loading