New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-98831: Support cache effects in super- and macro instructions #99601
Conversation
If you want to schedule another build, you need to add the " |
This once again refactors a lot of the code generator. There are still cleanups to be done, and I'd like things to be more compact, but I think most of the refactoring is now done.
sup: SuperInstruction mac: MacroInstruction super: Super macro: Macro
I tried to split it into InstDef and OpDef, removing kind, but that caused problems because the Instruction class inherits from InstHeader.
@brandtbucher If you want larger diffs I can send you a later version that also updates a bunch of instructions ( |
As before, a later version passed all the buildbot tests and I am confident this one would too. |
if dedent != 0 and tkn.kind == 'COMMENT' and '\n' in text: | ||
if dedent < 0: | ||
text = text.replace('\n', '\n' + ' '*-dedent) | ||
# TODO: dedent > 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Leaving this TODO
for future work?
As a minor nit: the double-negative of a "negative dedent" is sort of strange to me. I'd personally find it easier to reason about "indents" and "negative indents"... but I'm not sure if that makes things more difficult elsewhere.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I inherited that from the lexer. I'll eventually fix it. For a while it was inconvenient because there was also a variable named indent
in a lot of places. That's now self.indent
. (But still, it's a string of spaces rather than an int.)
while self.expect(lx.PLUS): | ||
if tkn := self.require(lx.IDENTIFIER): | ||
ops.append(tkn.text) | ||
self.require(lx.SEMI) | ||
if op := self.op(): | ||
ops.append(op) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this allow a single op followed by a bunch of +
s? It looks like it might...
if (tkn := self.expect(lx.IDENTIFIER)) and tkn.text == "macro": | ||
if self.expect(lx.LPAREN): | ||
if tkn := self.expect(lx.IDENTIFIER): | ||
if self.expect(lx.RPAREN): | ||
if self.expect(lx.EQUALS): | ||
if uops := self.uops(): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmmmm. I know you prefer one test per if
, but I think six nested if
s is pushing it for generated code, let alone human code. Maybe one test per line?
if (tkn := self.expect(lx.IDENTIFIER)) and tkn.text == "macro": | |
if self.expect(lx.LPAREN): | |
if tkn := self.expect(lx.IDENTIFIER): | |
if self.expect(lx.RPAREN): | |
if self.expect(lx.EQUALS): | |
if uops := self.uops(): | |
if ( | |
(tkn := self.expect(lx.IDENTIFIER)) | |
and tkn.text == "macro" | |
and self.expect(lx.LPAREN) | |
and (tkn := self.expect(lx.IDENTIFIER)) | |
and self.expect(lx.RPAREN) | |
and self.expect(lx.EQUALS) | |
and (uops := self.uops()) | |
): |
But consistency within the file is probably more important.
else: | ||
return CacheEffect(tkn.text, size) | ||
raise self.make_syntax_error("Expected integer") | ||
else: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is another case where the deep nesting impairs readability: with the naked eye, it's pretty hard to tell which if
this else corresponds to.
DEFAULT_INPUT = os.path.relpath( | ||
os.path.join(os.path.dirname(__file__), "../../Python/bytecodes.c") | ||
) | ||
DEFAULT_OUTPUT = os.path.relpath( | ||
os.path.join(os.path.dirname(__file__), "../../Python/generated_cases.c.h") | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorta defeats the purpose of os.path.join
... ;)
DEFAULT_INPUT = os.path.relpath( | |
os.path.join(os.path.dirname(__file__), "../../Python/bytecodes.c") | |
) | |
DEFAULT_OUTPUT = os.path.relpath( | |
os.path.join(os.path.dirname(__file__), "../../Python/generated_cases.c.h") | |
) | |
DEFAULT_INPUT = os.path.relpath( | |
os.path.join(os.path.dirname(__file__), os.pardir, os.pardir, "Python", "bytecodes.c") | |
) | |
DEFAULT_OUTPUT = os.path.relpath( | |
os.path.join(os.path.dirname(__file__), os.pardir, os.pardir, "Python", "generated_cases.c.h") | |
) |
for i, var in enumerate(reversed(up.stack[: up.final_sp]), 1): | ||
self.out.emit(f"POKE({i}, {var});") | ||
|
||
self.out.emit(f"DISPATCH();") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
self.out.emit(f"DISPATCH();") | |
self.out.emit("DISPATCH();") |
Yeah, it would be nice if instead of
we could write
I had originally thought that
But
I only came up with that while writing this reply, I'll have to play with it to see if it'll work. |
Note: this doesn't yet support cache effects in the
macro
definition itself (e.g.macro(X) = counter/1 + FOO + BAR + unused/2
). That seems something for a follow-up (we don't even have a use case for the current thing yet).Details
(From two original commits that were since merged.)
Super-instructions can now have cache effects.
To do this I mostly just had to move the cache effects code into
Instr*.write_body()
, reducing the responsibilities ofInstr*.write()
.(I also had to fiddle a bit with indents.)
For macros the same approach would almost work, except that
next_instr
might point in the middle of the cache when we encounterDEOPT_IF()
orERROR_IF()
in a second or further component.I have to think more about that.
NOTES:
super-instruction).
This shouldn't matter.
Macro instructions can now also have cache effects.
We pass the initial cache offset into write_body().
This is a little fiddly because everything is different
for super-instructions vs macros:
bump
next_instr
after each op.and we bump
next_instr
at the end.Also, I had to move the bump of
next_instr
back intoInstr*.write()
.It is better placed there anyway because that function avoids the bump
if the C code already ends in a
goto
,return
orDISPATCH*()
call.(The previous commit emitted one unreachable bump, which is now fixed.)
Tested manually.
NOTES
There's more refactoring coming.
Also included:
to_text()