-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Structure matching syntax #95
base: master
Are you sure you want to change the base?
Conversation
I think that we should leave the proposed |
From my POV, the thing that sunk the previous RFCs were exactly the questions that the That's the reason this is not an implementation RFC, by the way. The intention is that we align on overall direction first, and then implement in parts, as needed. To quote Andy from the previous thread:
It's worth noting that this RFC already punts on many of the integration problems we found, such as how destructuring assignments should work (or if they should work, given myself and Hunter don't know of any languages that support this). If we punt on substantially more than we do already, we'll just be back to the old RFC, at which point I don't know what different result we expect to find. So we need to decide on what our approaches will be for those facets of the feature, hence all of the extra syntax above what we already agreed upon. Whether we decide to support each feature or drop them from the spec, the decision must be documented, not deferred. |
Dropping the |
Coming into this a bit late I guess, but wanted to just ask some questions. Is there some issue with copying the JavaScript syntax wholesale? It's quite well thought out and tried and tested. For example one question I have is, why is it necessary to have a dot prefix for the destructured keys instead of:
|
I don't entirely disagree with that idea personally, but IIRC others shot this down already. The main trouble comes from the fact that array literals would look the same as keyed table destructuring: -- this would not work as expected, despite looking visually similar
local { foo, bar } = { "foo", "bar" } This was considered a showstopper for that syntax in particular. Beyond that, the square brackets idea has been floated a few times, but each time there's generally some disapproval since the only other time Luau uses square brackets is for indexing; we dont use I think JS has a well designed syntax for JS, but in the context of Luau there's multiple arguments against some of its choices. As for: local foo, bar, ...rest = t This would be backwards incompatible, as Luau supports multiple returns. Today, this runs: local foo, bar, baz = { "hello" }
print(foo, bar, baz) --> { [1] = "hello" } nil nil |
I very much prefer something along the lines of what @rihok suggested at the very bottom of the code sample, possibly something that outlines similarly to: local KeyedTable = {
Hello = true;
[5] = false;
[Instance] = 5;
}
local PackedTable = table.pack(1, 2)
local NestedTable = {
Nested = {
World = true;
};
}
local [ ["Hello"] = A, [5] = B, [Instance] = C ] = KeyedTable
local [ [1] = D, [2] = E, ["n"] = F ] = PackedTable
local [ ["Nested"] = [ ["World"] = G ] ] = NestedTable I assume square brackets here would be an understandable extension of the Luau language given that square brackets are only used for indexing (a point illustrated by @dphblox) and table destructuring is simply an abstraction of that. A few things to touch on surrounding this:
local [ Hello = A, [5] = B, [Instance] = C ] = KeyedTable
local [ Nested: [ World: G ] ] = NestedTable
local something = someTable
[ SomeIndex: GlobalVariable ] = someTable This has the same behavior as parentheses do in the language and results in roughly the same error contextually. This shouldn't be seen as a drawback and instead a reinforcement of the languages already well established behavior. I don't know what the implications are on parser complexity in regards to this or whether or not this fits within the purview/scope of this RFC. But I believe it's a fun and reasonable suggestion purely in terms of syntax. |
I would be somewhat OK with this, but it's a little weird that we would use square brackets both to surround the keys and to surround the whole matcher. Perhaps it would help with the "array problem": local [ these, are, keys ] = data
local { these, are, consecutive, values } = data But this would still be inverse of JS, which could be a stumbling block. IIRC, this is part of what caused the original RFC to fail, hence the dot prefixes. The older RFC has more details on that whole thing.
Colons are used for type annotations, so it would be almost certainly inappropriate to use them here. |
Here, let me throw out a few crazy syntax ideas, specifically trying to figure out how best to make array/dict difference obvious: local {in these, are, keys} = data
local {...these, are, consecutive, values} = data local in {these, are, keys} = data
local ...{these, are, consecutive, values} = data |
I think I'll try and write down my full chain of thoughts at the moment around the array and reassignment comments from the old RFC, as well as some syntax comments more generally. Take all this syntax with a pinch of salt - it's just to illustrate. Unlike JS, Luau allows you to shuttle around multiple values at once, making unpacking the natural paradigm for array destructuring: local these, are, consecutive, values = table.unpack(data, 1, data.n) So if we were to suggest any syntax there, it really feels like it ought to be expression-side, not assignment-side: local these, are, consecutive, values = ...data Naturally I would be inclined to extend that to keys (thus making this RFC redundant), but that introduces the problem of having to specify the keys twice in almost all situations: local these, are, keys = ...data["these", "are", "keys"] That's the grounds upon which I think the original destructuring RFC was made, and probably what we should focus on solving. That also aligns with the discussions we've had above. Trying to make this work for arrays seems misguided and I don't reckon it's even relevant. Now onto syntax. One of my earlier explorations was having a "destructuring assignment", though I was concerned about it looking too much like a compound assignment, or like it's unpacking values positionally: local these, are, keys ...= data In my eyes, the problem there is that throughout the rest of Luau, the names of those identifiers are not significant - only their positions are. Therefore, if the names are going to be significant, it's worthwhile to alter how those names are presented to make that clear. That's what the original RFC did. I'd say dot prefixes seem like a sensible and low-syntax way to indicate those names are keys. local .foo, .bar, .baz = data Of course, without the local thing
local function usesThing()
task.delay(1, function()
print(thing)
end)
end
local otherThings, foo, bar
.thing, .otherThings, .foo, .bar = getThings() Forward declaration is a little bit of a code smell IMO (though not always!), it feels unintuitive for people to destructure in this position, and JS doesn't support destructuring when assigning - only when declaring - so I would be personally OK dropping this requirement, though I'm not the person who brought it up in the first place. So all of that is my bias coming into this. I'm not saying we should do any of the above, but that's the general direction I lean in philosophically. My aim is to try and enmesh that with the previous RFC and the syntax everyone seemed to be OK with at the time. |
I cleaned up the proposal - I would consider this to be the apex of what an agreeable 100%-brace-based syntax would look like. To summarise, here's what's left at this point:
Things that have been explicitly rejected;
There's some other suggestions in this thread for syntaxes based on brackets (e.g. |
All this being said, I'm personally less convinced that we even need everything that's in this RFC as it stands. Given the direction the discussions have drifted towards, combined with the practical examples that've arose, I struggle to see much of the use case beyond simple named destructuring. There's an argument to be made that we should elide much of the complex concerns and stick to something much simpler to retain the spirit of the Luau language. Much of what's in here is my attempt at serving as many use cases as possible, but if we wanted to get most of the value with less complexity, we could just do something like local .foo, .bar in data and accept that we'll never have a full destructuring system, primarily because we'll never need one. Array destructuring has |
Check out the conversation about that (. We shouldn't do this because, AFACT, all other languages that implement some sort of pattern matching opt to mirror their construction / destruction syntax.
I chatted about this offline with some folks (mainly @aatxe): destructuring syntax / pattern matching in general is a power user feature. The unfortunate part is that it's also the principled way to do named imports without adding a feature for it, such as JavaScript imports. I wonder if we should support exactly one syntax to start:
No nested syntax, no support for custom indexers, no support for array-like members. As-is I think the |
Yeah this is part of the bikeshedding we need to figure out still. I just tried to match the mood of some other comments but don't personally have a strong position on which I'd prefer. I will say that this feels unclear to me: local { foo = bar } = getData() It isn't obvious which side is the identifier and which side is the key. Since this isn't an ambiguous position, a contextual keyword would help clarify this and give a natural "orientation" to the statement. I also think it's more "Luau-like" to use keywords over symbols. -- for demonstration purposes
local { foo as bar } = getData()
local { foo in bar } = getData() But the ambiguity could get resolved by symbols too, just more noisily: -- for demonstration purposes
local { foo -> bar } = getData()
local { foo = .bar } = getData()
local { .foo = bar } = getData()
local { [foo] = bar } = getData()
-- etc... I think I'll just defer to the better judgement of the team for this one, but my personal preference would be |
Yeah this is the way I'm leaning at this point too. I might pare down the RFC to literally just this. I'm happy we spent the time to explore the other options if only so we could figure out why not to pursue them. |
Alright, I've whittled down the RFC to just the basic feature as suggested. Happy to integrate further thoughts if desired :) |
Nice whittle! Could we also note syntax like local .foo, .bar in data
local .foo, .bar = data
local foo, bar in data in the alternatives section? In drawbacks, maybe note that As an another alternative, we could generalize Since For example: local x = .x in somevector
assert(x == somevector.x)
-- w/ syntax sugar:
local .x in somevector
local .x, .y in somevector
type Profile = {
ip: string,
hostname: string,
last_seen: DateTimeUtc,
}
type DateTimeUtc = {
unix_timestamp: number,
format: (self: DateTimeUtc, fmtstring: string) -> string,
}
local profile: Profile
local ip, hostname, timestamp = .ip, .hostname, .last_seen.unix_timestamp in profile
-- w/ sugar
local .ip, .hostname, .timestamp.unix_timestamp as timestamp in profile -- for demonstration purposes; `as` might not be the right choice here `as` usual |
I'm not against the idea of a braceless syntax, but I do have one or two concerns with it. Firstly, in the case of something like React props, is it nice to format long lists of identifiers when there's no block implied by braces? local
position,
size,
bgColor,
textColor
onClick,
onHover,
onFocus in props versus local {
position,
size,
bgColor,
textColor
onClick,
onHover,
onFocus
} in props Secondly is that it can get weird if we use keywords for both renaming and assignment: local foo, bar in baz in data versus local { foo, bar in baz } in data And thirdly is that I wonder how scalable the pattern is to other places in Luau where matching structure may be desirable, for example the original RFC mentions function arguments: local function foo(
foo, bar in baz
) versus local function foo(
{ foo, bar } in baz
) or even, if we allow anonymous parameters: local function foo(
foo, bar
) versus local function foo(
{ foo, bar }
) So for the time being I'd rather stick to a brace-based syntax for the benefits of block delineation. Open to discuss whether |
Gathering consensus on the destructuring syntax inside of the braces (i.e. punting on other issues explicitly), so we can align on this with the community.
Rendered.