Parsing compose IDs is (practically) impossible if `-` is allowed in product short name

I'm not sure if this is an issue or a cry for help...:)

Consider these compose IDs (one a real Fedora compose ID, one a made-up layered product compose ID from the test suite, since I don't know where to find a real RHEL layered product compose ID):

`Fedora-Atomic-25-20170205.0` (product short name: `Fedora-Atomic`)
`F-22-updates-BASE-3-updates-20160622.n.0` (product short name: `F`)

Now I read the code, all kinds of other subtleties are possible, which makes parsing compose IDs just super fun. But you could, I think, cope with many if not all of them by counting `-` characters. Maybe. I don't know. Life is awful.

However, as long as `-` characters are allowed in the product short name, all bets are off, because it just becomes entirely impossible to figure out which `-`s are being used as some kind of field separator by productmd, and which are just part of the product name (as in the first example).

I've been doing this to parse compose IDs in Fedoraland:

    (name, version, _) = compose_id.rsplit('-', 2)
    (date, typ, respin) = productmd.composeinfo.get_date_type_respin(compose_id)

The `rsplit` trick is to handle there being dashes in the shortname, and it relies on there being a known number of dashes after the shortname. But now I look at the test cases and code upstream, it's clear this is rather specific to my use cases and is relying on the fact that we don't have layered products (or at least, if we do, I don't care about any of 'em), or any `type_suffix`es (at least ones with dashes in them? I don't really know what these are), and I certainly don't care about RHEL 5 composes. But clearly someone does care about all those cases, so putting a generic compose ID parser in productmd (which is what I was trying to do) appears to be really rather futile, or at the least, would be very complex and have quite a bit of magic knowledge in it.

So...I don't know what to do. Would it be worthwhile making it a rule that product short names can't have dashes in them, and changing Fedora's configurations appropriately (to use `FedoraAtomic`, `FedoraCloud` and `FedoraDocker` as the short names, or something)? Or should I just give up on generically parsing compose IDs and rejig everything I have that does it, somehow (probably by requiring a round trip to the compose location or to PDC to fetch the full compose metadata, unfortunately)? Currently we have various things that get useful information just by parsing the compose ID because that's the only thing they get from a fedmsg they parse, and it's nice to be able to avoid a round trip. Doing this kind of rejigging for fedfind would also be rather tricky architecturally, but that's more or less my problem and not yours...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parsing compose IDs is (practically) impossible if `-` is allowed in product short name #77

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Parsing compose IDs is (practically) impossible if - is allowed in product short name #77

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Parsing compose IDs is (practically) impossible if `-` is allowed in product short name #77