Commits · c83f33a3c42177f5db754746bcc33ea08ba70d2b · Boxiang Sun / Pyston

05 Apr, 2016 5 commits

Use patchpoints to skip decref jit overhead · c83f33a3

Kevin Modzelewski authored Apr 05, 2016

Represent decref/xdecref operations as opaque functions calls.
I think the ideal solution would be to add a custom llvm intrinsic,
but I spent a small amount of time looking into that and had trouble
figuring out how to do that.

So instead, just emit them as patchpoints, and then patch them afterwards
with a fixed code sequence.

This commit only does this for decref/xdecref because:
- they occur much more frequently
- they are much more expensive to jit since they involve control flow
- forcing the op to fit a C-calling-convention isn't that much overhead,
  since the register allocator probably would have done that anyway due
  to the (potential) dealloc call.

c83f33a3

Involve instance attributes in GC · 31a03a8d

Kevin Modzelewski authored Apr 05, 2016

I copied subtype_traverse and subtype_clear from cpython but
didn't update them to support our hcattrs

31a03a8d

bjit: more refcounting fixes · c9cd0170
Marius Wachtler authored Apr 05, 2016

c9cd0170

misc fixes to get the bjit further · 8d7d24ad

Marius Wachtler authored Apr 05, 2016

- fix instruction encoding of: add imm, mem
- fix Py_REF_DEBUG when increfing multiple times at once
- add missing bjit annotations

8d7d24ad

Merge pull request #1129 from Daetalus/ref_nexedi_tuple · ede3f121
Kevin Modzelewski authored Apr 04, 2016
```
small refcounting fixing in tuple
```
ede3f121

04 Apr, 2016 9 commits

these tests are passing now · 84bee77c
Kevin Modzelewski authored Apr 04, 2016
```
I think they were timing out before
```
84bee77c

Low-tech optimization: add xdecrefAll() function · 19ed7064

Kevin Modzelewski authored Apr 04, 2016

For use on the exception path. Rather than emitting the instructions
for a bunch of decrefs, instead just emit a single call to xdecrefAll()

Improves performance quite a lot: sre_parse_parse llvm instructions goes
from 360k to 80k, and it's 60k if I manually disable cxx fixups.
Overall time [which is entirely compile time] goes from 12.5s to 4.4s,
though master does it in 1.3s so there's still some work to do. But
even if I turn off cxx fixups entirely it still takes 2.9s.

19ed7064

Generate cxx "fixups" on-demand · 21b20e3b

Kevin Modzelewski authored Apr 04, 2016

fixups aka the stubs that decref whatever's needed when an exception is thrown

I looked into this because most (75%?) of the refcounting overhead
comes from the cxx fixups. Previously we would always generate them in
the IRGenerator, regardless of whether they were needed. Now they are
generated in the refcounter, which knows whether they are needed or not.

Unfortunately it looks like they are usually needed, so the gains here
aren't that great (saves about 10% llvm instructions whereas cxx fixups
in general added about 400% more llvm instructions).

I think this is still a good change because it's also necessary in order to use
Marius's EH stuff.

I think the cost of the fixups is mostly related to the cost of the decrefs
that it adds, so even though most of the refcounting overhead seems to be due to
adding the cxx fixups, reducing general decref overhead might reduce cxx fixup overhead

21b20e3b

small refcounting fixing in tuple · b2f3653a
Boxiang Sun authored Apr 05, 2016

b2f3653a
Count the number of llvm instructions · 7b95f4d5
Kevin Modzelewski authored Apr 04, 2016

7b95f4d5

Format! · fcf746df

Kevin Modzelewski authored Apr 04, 2016

Any existing work on top of the unformatted branch might have a hard time
merging -- it should work ok to do a format on top of your changes and then
merge with this commit.

fcf746df

This test is working now · 7e955a99
Kevin Modzelewski authored Apr 04, 2016

7e955a99
Fix a few more list refcount issues · 5a03ae22
Kevin Modzelewski authored Apr 02, 2016

5a03ae22
Fix a bug in list.__index__ · 8b483912
Kevin Modzelewski authored Apr 02, 2016
```
I think this exists on master as well.
```
8b483912

02 Apr, 2016 4 commits

Merge pull request #1128 from Daetalus/refcount_long_hash · cf179f85
Marius Wachtler authored Apr 02, 2016
```
Two minor refcounting fixing in int and long functions.
```
cf179f85
add a option to disable signal checking · 4ca4c082
Marius Wachtler authored Apr 02, 2016
```
this is nice for making the generated LLVM IR and bjit code simpler when debugging code
```
4ca4c082

Fix name deleting · 87bf4d5d

Kevin Modzelewski authored Apr 01, 2016

Name deleting stores a NULL into the vregs array, and then
consumes a reference to it.  Another solution would be to change
the name deleting to not consume the reference, but this commit
changes the refcounter to handle null values better.

87bf4d5d

These are all working now · 8a0dcce8
Kevin Modzelewski authored Apr 02, 2016

8a0dcce8

01 Apr, 2016 5 commits
- Fix custom tuple subclasses · aa06c018
  Kevin Modzelewski authored Apr 01, 2016
  
  aa06c018
- minor refcounting fixing in long.__hash__ · 2c4a68a6
  Boxiang Sun authored Apr 02, 2016
  
  2c4a68a6
- minor refcounting annotation fixing in int.__abs__ · b5f3a74e
  Boxiang Sun authored Apr 02, 2016
  
  b5f3a74e
- Improvements to the refcount checker · a0471f34
  Kevin Modzelewski authored Apr 01, 2016
```
I thought I was getting close so I spent a few days on this.
But there's still a lot of work left to be done to get it to be usable.
```
  a0471f34
- Fix classobj slice crashes · ea7fe5c2
  Marius Wachtler authored Apr 01, 2016
  
  ea7fe5c2
31 Mar, 2016 6 commits

Workaround OSR issue with undefined variables · 569fbf0b
Marius Wachtler authored Mar 31, 2016
```
Just pass a increfed None in becasue the LLVM tier will always decref this value
```
569fbf0b
simplify code by reusing the helper · e4e4df2e
Marius Wachtler authored Mar 31, 2016

e4e4df2e
add missing refcounting annotation to compare() · 56d4f42e
Marius Wachtler authored Mar 31, 2016

56d4f42e
Fix bjit del crashes · b6d8fbe7
Marius Wachtler authored Mar 31, 2016

b6d8fbe7

Check CAPI exceptions before signals · 1077e7f2

Kevin Modzelewski authored Mar 31, 2016

Previously we would check signals first.  Which means that we would then
call into a signal handler with an active exception, which would later
trigger asserts.

For CXX functions, the exception automatically wins over the signal checking.
CPython also checks signals first.

The only tricky thing is that this was happening because the signals stuff
was hooked deeper down the stack.  So pass down the CAPI-exception data as well.

1077e7f2

Fix some more CAPI-only bugs · efd8ce63

Kevin Modzelewski authored Mar 31, 2016

One of them was an issue of defining a CXX-style accelerator but
not a CAPI-style one, so that's now asserted against.

efd8ce63

30 Mar, 2016 9 commits
- Fix some bugs that made None createable · 0eca1ddb
  Kevin Modzelewski authored Mar 30, 2016
```
Copy over tp_new_wrapper, which is the main thing that should be doing the check.
Our implementation was pretty much the same minus that check.

There's also a separate check that isn't completely necessary but seems like a good idea,
and we had it on certain codepaths, and whether you hit it depended on whether you were
in CAPI mode or not.
```
  0eca1ddb
- these are working · 0e2566d1
  Kevin Modzelewski authored Mar 30, 2016
  
  0e2566d1
- Fix a couple CAPI-only bugs · 298c835c
  Kevin Modzelewski authored Mar 30, 2016
  
  298c835c
- A couple more nullable values · 3b9ca8a5
  Kevin Modzelewski authored Mar 30, 2016
  
  3b9ca8a5
- Support writing out xdecrefs in the llvm jit · 896f568b
  Kevin Modzelewski authored Mar 30, 2016
```
It will end up spitting out something like:

%x = callattrCapi()
Py_XDECREF(x)
if (!x) throwCapiException()

We usually write something more like:

%x = callattrCapi()
if (!x) throwCapiException()
Py_DECREF(x)

But with optimizations turned on, llvm will turn them into
the same thing.
```
  896f568b
- fix generator_collection.py OSR · 65272d1a
  Marius Wachtler authored Mar 30, 2016
  
  65272d1a
- fix a few OSR problems with unanotted null pointers · 5d44b58e
  Marius Wachtler authored Mar 30, 2016
  
  5d44b58e
- fix closure bugs · a3f6d697
  Marius Wachtler authored Mar 30, 2016
  
  a3f6d697
- Working on intmethods.py · e14f792a
  Kevin Modzelewski authored Mar 30, 2016
  
  e14f792a
29 Mar, 2016 2 commits
- Get file_writing.py working · ea7d3f0b
  Kevin Modzelewski authored Mar 29, 2016
  
  ea7d3f0b
- These working now too · 6d18ce0a
  Kevin Modzelewski authored Mar 29, 2016
  
  6d18ce0a