Skip to content

Robust :and parser, add :andn #1182

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 22 commits into
base: master
Choose a base branch
from

Conversation

frenchy64
Copy link
Collaborator

@frenchy64 frenchy64 commented Mar 28, 2025

Close #1166
Close #1173

This tightens up -parser for :and in several ways.

The essential insight is that there are two kinds of parsers, I'm calling transforming (e.g., :orn, -collection-schema) and simple (e.g., :any, -simple-schema). Simple parsers return identical input on success. Everything else is transforming.

In all cases I've seen so far, it's possible to accurately predict whether a parser is simple based on its schema. With this information, we can now improve :and's parser by:

  1. banning more than one transforming parser per :and
  2. running the transforming parser last
  3. running the transforming unparser first

This automatically handles [:and S [:fn ..]] and makes it more robust, as :fn is now passed the input value instead of the parsed value and the conjuncts can be in any order.

Extras:

Adds a new schema :andn for when you really want multiple transforming parsers in a conjunction. It reparses the input for each conjunct and returns in a Tags. Unparser only unparses the leftmost child, which enables users to transform the unparsed results by removing the other results.

We can now more aggressively optimize simple (un)parsers upfront to not build a result when it will be identical to the input.

Includes a fix for #1173 by bumping up the :max-tries for generating distinct vectors.

@frenchy64 frenchy64 marked this pull request as ready for review March 28, 2025 23:23
@frenchy64 frenchy64 changed the title WIP: Robust :and parser, add :andn Robust :and parser, add :andn Mar 28, 2025
@frenchy64 frenchy64 requested review from opqdonut and ikitommi March 28, 2025 23:23
Copy link
Member

@opqdonut opqdonut left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some questions / comments. Not confident enough yet to approve.

; :flat [#malli.core.Tag{:key :name, :value "x"}
; #malli.core.Tag{:key :id, :value 1}
; #malli.core.Tag{:key :name, :value "y"}
; #malli.core.Tag{:key :id, :value 2}]}}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, this behaviour makes sense, I get it

(if (-ref-schema? this)
(-parser-info (-deref this))
(when (-> this -parent -type-properties ::simple-parser)
{:simple-parser true}))))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's the need for both ParserInfo and -type-properties ::simple-parser?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could get away with just ParserInfo. It seemed neater at the time to have it at the type-level for trivial types.

(defmethod accept :orn [_ s children _]
(let [children (map last children)
base (-base s children)]
(assoc base :x-anyOf children)))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for adding this missing case!

(reduced ::invalid)
(cond-> acc
(not simple) (conj v')))))
(if simple x []) x)]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't understand the changes around here. Is there a corresponding test?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since we know the child has a simple parser, we don't need to rebuild the result, since it means the results of parsing is either ::invalid or x.

The "then" branch of malli.parser/ensure-parser-type tests this. For example, [:vector ::HOLE] with ::HOLE being a simple parsing schema like :any is expected to be a simple parser (expected-simple == true) so any mg/samples we take of it will {un}parse back to the identical sampled value.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

aha, this is the optimisation you mention in the PR description, right?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, specifically for :map-of, -collection-schema, and :map's default value.

@opqdonut
Copy link
Member

opqdonut commented Apr 1, 2025

Letting @ikitommi have a look as well.

@frenchy64
Copy link
Collaborator Author

Thanks for looking @opqdonut.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants