Add MIR_scan_string_s #341

iacore · 2023-06-04T19:46:33Z

Pass all test.

I'm not sure what the best API would be. At least MIR_scan_string_s should be safer.

More improvement opportunities: Once it is sure that no \0 is in input string, maybe other code can be simplified too?

vnmakarov · 2023-06-06T20:00:28Z

I am not sure about this PR. I am trying not to change API w/o significant advantages. But I'll think more about this.

rofl0r · 2023-06-06T20:35:54Z

it would appear this PR makes code slower for no good reason other than to appear as "safe" as Annex K

iacore · 2023-06-07T16:29:38Z

I made this PR because I don't want to append the final \0 for mmaped files.

\0 can appear inside a normal file. The current parser stop parsing at first \0. Maybe it should be an error, or ignore the \0?

snej · 2023-07-31T17:45:39Z

Not all strings are nul-terminated. Raw text files are a common case, as mentioned above. And this comes up a lot in FFI: I’ve worked with several languages whose strings are passed into C as unterminated (pointer, length) pairs. IIRC both Go and Python do this. And then there’s C++’s std::string_view which can hold arbitrary substrings.

If the overhead of calling strlen is a problem, that could be removed; instead, just keep the nul check, and set the max len to a huge value in the existing function so the nul byte will be hit first. (There’s no valid reason to have a 00 byte in either ASCII or UTF-8 text.)

On the other hand, I see the string API as mostly for debugging, so is it really important to save the overhead of copying the text into a nul-terminated buffer?

iacore · 2023-11-30T19:26:25Z

If the overhead of calling strlen is a problem, that could be removed; instead, just keep the nul check, and set the max len to a huge value in the existing function so the nul byte will be hit first. (There’s no valid reason to have a 00 byte in either ASCII or UTF-8 text.)

I have applied the suggestion in the last commit.

iacore added 2 commits June 4, 2023 19:43

Add MIR_scan_string_s

8030a18

Patch MIR.md

fd8117a

apply suggestions

a332e96

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MIR_scan_string_s #341

Add MIR_scan_string_s #341

iacore commented Jun 4, 2023 •

edited

Loading

vnmakarov commented Jun 6, 2023

rofl0r commented Jun 6, 2023

iacore commented Jun 7, 2023

snej commented Jul 31, 2023

iacore commented Nov 30, 2023

Add MIR_scan_string_s #341

Are you sure you want to change the base?

Add MIR_scan_string_s #341

Conversation

iacore commented Jun 4, 2023 • edited Loading

vnmakarov commented Jun 6, 2023

rofl0r commented Jun 6, 2023

iacore commented Jun 7, 2023

snej commented Jul 31, 2023

iacore commented Nov 30, 2023

iacore commented Jun 4, 2023 •

edited

Loading