feat: versioned protobufs #32

tomasciccola · 2023-02-14T14:31:45Z

No description provided.

index.js

sethvincent · 2023-02-15T21:03:53Z

It looks like the tests are failing because schemasPrefix.js needs to be updated to use the { dataTypeId, schemaVersion } object.

tomasciccola · 2023-02-16T17:05:04Z

It looks like the tests are failing because schemasPrefix.js needs to be updated to use the { dataTypeId, schemaVersion } object.

ooops, yeah, I forgot to commit that change...

This is so that schemas that doesn't have schemaVersion can be generated too

tomasciccola · 2023-02-22T16:59:14Z

index.js

-  ]
-    .encode(record)
-    .finish()
+  const partial = ProtobufSchemas[key].fromPartial(record)


Using fromPartial to initialized lists to an empty list (since optional repeated doesn't exists), means initializing every non passed field to the initial default value, which means required fields that are missing don't get catched...

I think we could catch this before it gets to the encode step. Is this only applicable to required arrays?

* added optional fields to filter and preset protobufs * added field protobuf and schema * type is optional on Preset_1 so need to be made optional on base type (and deleted before validation) * schemaVersion is optional on Field_1 so need to be made optional on base type (and deleted before validation) * this means utils.js/formatSchema{Key,Type} no takes optional values

tomasciccola · 2023-02-22T17:06:36Z

schema/field/v1.json

+        }
+      }]
+    },
+    "type": {


in Field_1, type doesn't mean "which type of document am I?" but and enum of possible fields :|. This means Field_1 can't be validated yet (thinking about ways to avoid this issue...)

Let's make an issue for this and come back to it. In the short-term I think we might not need to validate Field but likely will eventually.

The only solution to this I can think of so far is renaming our type field for validation to something like schemaType so that it matches schemaVersion and is less likely to conflict with type used as a property name.

Yeah, I think we can make that case for new schemas, but for old schemas the type still mostly refers to schemaType so we will still need to handle that manually

Casing on a doc type shouldn't be handled specifically by the code. Nevertheless, some schemas enforce a type to a specific value (type must be i.e. 'observation', which is different from 'Observation'). This means that type "observation" and "Observation" should have different dataTypeIds. That way we know what "type" to fill-in on returning the decoded object to be validated... If we want the same `dataTypeId` for 'observation' and 'Observation' (since we know we're actually talking about roughly the same type of 'document'), we could format the schemasPrefix object differently like: Observation: `{dataTypeId: '924892', versions: {'observation': 4, 'Observation': 5}}` or something less worse...

tomasciccola · 2023-02-23T18:06:21Z

schemasPrefix.js

-  Observation: '71f85084cb94',
+const schemasPrefix = {
+  Observation: { dataTypeId: '71f85084cb94', schemaVersions: [5] },
+  observation: { dataTypeId: '70f85084cb94', schemaVersions: [4] },


Casing on a doc type shouldn't be handled specifically by the code. Nevertheless, some schemas enforce a type to a specific value (type must be i.e. 'observation', which is different from 'Observation'). This means that type "observation" and "Observation" should have different dataTypeIds. That way we know what "type" to fill-in on returning the decoded object to be validated...
If we want the same dataTypeId for 'observation' and 'Observation'
(since we know we're actually talking about roughly the same type of
'document'), we could format the schemasPrefix object differently like:
Observation: {dataTypeId: '924892', versions: {'observation': 4, 'Observation': 5}}
or something less worse...
@sethvincent @gmaclennan

Ah, yeah that's kinda tricky to have to check for both.

I think ideally it would stay lowercase but maybe that's awkward for code generation? I'm not quite familiar enough with the code generation to know if it could be a matter of capitalizing the first letter somewhere in the script.

I mean, from the point of view of code generation, that would mean changing the schemas and how they are compiled, basically setting all 'type' fields to a capitalized version or lowercase version. There are some schemas that doesn't have a type field, or that the type field is used for something completely different, and we would need to handle that manually.
I don't think that would be a good approach.
In terms of complexity, the current way is simpler IMO.
I don't think there's a non-weird approach though...

I guess I'm not sure why observation changed to Observation. If that's a needed change we should make sure other schemas follow the same pattern. coreOwnership would be CoreOwnership, filter to Filter for example.

What if we had something like this and handle the little o to big O within this module? (For example, transform lowercase o to capitol and match based on the schema version)

Observation: { dataTypeId: '71f85084cb94', schemaVersions: [4, 5] },

I think keeping the same dataTypeId for the two schemas makes sense (in part because v5 is just an improvement on v4).

sethvincent · 2023-03-07T01:05:08Z

This is looking good!

There's a set of potential problems around type naming and exporting that I've been thinking about today.

One issue I'm seeing is in cases where someone would import both the protobuf type and the jsonschema type in one file there might be naming conflicts.

It might be ideal to have a naming convention like:

CoreOwnershipJson
CoreOwnershipProtobuf

Kinda related: it looks like the protobuf comments export names like CoreOwnership_1 rather than CoreOwnership.

I have mixed feelings on whether the public types should include the version in the name. The main place that might matter is in migration code where it's needed to be explicit about versions and use multiple versions in the same code.

Ideally we could import latest version as just the name:

import { ObservationJson, ObservationProtobuf } from 'mapeo-schema/types'

As well as being more specific about the version needed:

import { ObservationJsonv4, ObservationJson5 } from 'mapeo-schema/types'

A type without a version would be the latest version. I can imagine there's a good argument for always including the version number and not having the convenience of importing the latest version by just the schema name.

Those examples also simplified importing the types from mapeo-schema/types.

This would be nice especially for our jsdoc typing where long paths get messy looking:

/**
* @property {import('mapeo-schema/types').ObservationJson} Observation
*/

versus:

/**
* @property {import('mapeo-schema/types/schema/observation/v5').Observation} Observation
*/

Importing types in this way would likely require generating an index.js/index.d.ts in the `types` directory and using the [`exports` field in package.json](https://nodejs.org/api/packages.html#package-entry-points) to map `/types` in the import string to the `./types/index.js` file. (I'm pretty sure this is how the above import examples could be achieved though haven't double checked)

I think the feedback in this comment could be broken up into issues that we tackle in follow-up prs so that the versioned protobufs work can be merged.

tomasciccola · 2023-03-07T15:44:37Z

Kinda related: it looks like the protobuf comments export names like CoreOwnership_1 rather than CoreOwnership.

Protobufs are all part of the same package (mapeo), and it doesn't allow two Message types with the same name, thats why the version is encoded on the type. But we could do smth with code generation where we rename the export (maybe with a union of every version defaulting to the last one?)

tomasciccola · 2023-03-07T15:58:59Z

@sethvincent do you want me to merge this already? I'll create additional issues to handling pending things as per your comments

sethvincent · 2023-03-07T16:00:52Z

@tomasciccola sounds good!

feat: minor refactoring. move formatSchemaType to utils.js

daa2118

tomasciccola requested review from gmaclennan and sethvincent February 14, 2023 14:31

sethvincent reviewed Feb 15, 2023

View reviewed changes

index.js Outdated Show resolved Hide resolved

chore: update schemasPrefix.js

d2ae537

Tomás Ciccola added 6 commits February 21, 2023 13:44

feat: version protobufs. contemplate invalid schemaVersion on tests

cae214d

feat: fromPartial on encoding to allow optional lists on input

3e33597

feat: throw error if invalid type_schemaVersion on decoding

5668f73

feat: add formatSchemaType on utils to get key to schemas

0527860

feat: add preset and filter proto and make semi-working example for both

4bab3a0

feat: build versioning of schemas from filesystem instead of on schemas

549197e

This is so that schemas that doesn't have schemaVersion can be generated too

tomasciccola commented Feb 22, 2023

View reviewed changes

Tomás Ciccola added 2 commits February 22, 2023 15:24

chore: solved minor type error

4348219

tomasciccola commented Feb 23, 2023

View reviewed changes

Tomás Ciccola added 7 commits February 23, 2023 16:27

chore: logic error on inheritsFromCommon

88653d8

feat: add tests for many doc types. Test present fields and values

2193743

feat: add coreOwnership and device schema

ffd7b25

feat: add schema for role

760a9fe

feat: add testing for validation of JSONSchema docs

604161c

feat: created_at is saved as Timestamp on protobuf

670cc9e

feat: timestamp is saved as Timestamp on protobuf

4cdcb4a

This was referenced Jul 28, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.4 #86

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.5 #103

Merged

This was referenced Aug 15, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.6 #119

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.7 #131

Merged

optic-release-automation bot mentioned this pull request Aug 23, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.8 #136

Merged

This was referenced Sep 15, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.9 #144

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.10 #146

Closed

This was referenced Sep 28, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.10 #148

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.11 #153

Merged

optic-release-automation bot mentioned this pull request Oct 10, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.12 #157

Merged

This was referenced Oct 25, 2023

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.13 #158

Closed

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.13 #159

Merged

optic-release-automation bot mentioned this pull request Feb 14, 2024

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.14 #167

Merged

optic-release-automation bot mentioned this pull request Apr 16, 2024

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.15 #180

Merged

optic-release-automation bot mentioned this pull request May 15, 2024

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.16 #184

Merged

This was referenced Aug 1, 2024

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.17 #201

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.18 #204

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.19 #210

Merged

This was referenced Aug 24, 2024

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.20 #217

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.21 #219

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.22 #222

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.23 #230

Merged

This was referenced Sep 2, 2024

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.24 #233

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.25 #240

Merged

[OPTIC-RELEASE-AUTOMATION] release/v3.0.0-next.26 #242

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: versioned protobufs #32

feat: versioned protobufs #32

tomasciccola commented Feb 14, 2023

sethvincent commented Feb 15, 2023

tomasciccola commented Feb 16, 2023

tomasciccola Feb 22, 2023

sethvincent Mar 7, 2023

tomasciccola Feb 22, 2023

sethvincent Mar 7, 2023

tomasciccola Mar 7, 2023

tomasciccola Feb 23, 2023

sethvincent Feb 23, 2023

tomasciccola Feb 23, 2023

sethvincent Mar 7, 2023 •

edited

Loading

sethvincent commented Mar 7, 2023

tomasciccola commented Mar 7, 2023

tomasciccola commented Mar 7, 2023

sethvincent commented Mar 7, 2023

feat: versioned protobufs #32

feat: versioned protobufs #32

Conversation

tomasciccola commented Feb 14, 2023

sethvincent commented Feb 15, 2023

tomasciccola commented Feb 16, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sethvincent Mar 7, 2023 • edited Loading

Choose a reason for hiding this comment

sethvincent commented Mar 7, 2023

tomasciccola commented Mar 7, 2023

tomasciccola commented Mar 7, 2023

sethvincent commented Mar 7, 2023

sethvincent Mar 7, 2023 •

edited

Loading