- 03 Jun, 2015 1 commit
-
-
Kevin Modzelewski authored
Float pow
-
- 02 Jun, 2015 5 commits
-
-
Kevin Modzelewski authored
Add a section-ordering script
-
Kevin Modzelewski authored
-
Kevin Modzelewski authored
I think this lets us specify that certain functions should be put together at the end of the text segment. This is inspired by a similar feature of HHVM's build, though the goal for us for now is just to improve performance consistency rather than overall performance. Hopefully soon/eventually we can do profile-guided sorting like they do.
-
Kevin Modzelewski authored
Fix some issues with the way we build libunwind
-
Kevin Modzelewski authored
First, that we would only apply our patchset once. If we ever revert the patches (I'm not sure under what conditions that happens), we previously would never apply them again. Attempted to fix this by adding a special patch that adds a new file that CMake looks for; if the file doesn't exist, cmake runs the patches again. Second, that we didn't rebuild libunwind if we apply new patches. I'm not sure if there's a good general solution to this, but I was able to figure out how to force libunwind to rebuild if we need to rerun the patch command. It took some hacking since CMake doesn't track dependencies on external projects, so we have to add some custom dependencies.
-
- 01 Jun, 2015 10 commits
-
-
Kevin Modzelewski authored
remove redundant -Wno-sign-cmopare
-
Kevin Modzelewski authored
Switch hash to using tp_hash
-
Chris Toshok authored
-
Kevin Modzelewski authored
We weren't even doing any rewriting for hash, so there's not much downside. This also cuts down on boxing quite a bit since we can usually avoid boxing the hash value.
-
Kevin Modzelewski authored
Perf investigations
-
Kevin Modzelewski authored
Coming from looking into regex performance; re_compile is reduced from django-template startup, and dict_hashing_ubench is reduced from that.
-
Kevin Modzelewski authored
Also, quiet some debug output
-
Kevin Modzelewski authored
-
Kevin Modzelewski authored
use llvm::StringRef instead of std::string typeNew
-
Chris Toshok authored
turns out we allocate/free the same std::strings for every slot name, for every typeNew. instead do it once, when we create the slotdefs array. Also, use llvm::StringRefs instead of std::strings since we already have them in setattrGeneric (the other caller of update_slot.)
-
- 29 May, 2015 11 commits
-
-
Kevin Modzelewski authored
Add UTF8-BOM support, int.bit_length, function.func_doc, fix '(-1)**0'
-
Marius Wachtler authored
-
Marius Wachtler authored
-
Kevin Modzelewski authored
Python-level sampling profiler
-
Kevin Modzelewski authored
-
Kevin Modzelewski authored
-
Kevin Modzelewski authored
Uses setitimer() to set a recurring signal, and prints a Python stacktrace at the next safepoint aka allowGLReadPreemption. This is not great since allowGLReadPreemption can happen a decent amount later than the signal. (I'll play around with trying to get the signal to be acted on sooner, but it might be better to wait for full signal-handling support.) Still, it seems to provide some decent high-level info. For example, half of the startup time of the django-template benchmark seems to be due to regular expressions.
-
Kevin Modzelewski authored
Handle list self assignment during slicing
-
Kevin Modzelewski authored
Add some parsing tests
-
Marius Wachtler authored
-
Kevin Modzelewski authored
- byte order marker - newline between decorator and its function - strings with size >64k [including a fix] also, a decimal.Decimal test
-
- 28 May, 2015 12 commits
-
-
Kevin Modzelewski authored
Add PyErr_GetExcInfo, make __builtins__ more similar to cpython and id() output more useful
-
Kevin Modzelewski authored
Convert runtime functions to take llvm::StringRef
-
Marius Wachtler authored
-
Marius Wachtler authored
-
Marius Wachtler authored
-
Marius Wachtler authored
-
Kevin Modzelewski authored
This only happens to me on the gcc build, and it apparently doesn't happen on travis-ci. Not exactly sure why that would cause the code to get linked or not, but anyway it's stuff we're not using right now so just ifdef it out.
-
Kevin Modzelewski authored
-
Kevin Modzelewski authored
-
Kevin Modzelewski authored
Should hopefully cut down on allocations to pass around 'const std::string&' objects (since we don't always store things as std::strings anymore), or to calls to strlen if we pass around const char*s. Haven't looked yet at the calls that we embed in the llvm IR. Here are the perf results: pyston django_migrate.py : 2.3s baseline: 2.3 (-1.7%) pyston django-template.py : 15.1s baseline: 15.4 (-1.6%) pyston interp2.py : 5.3s baseline: 6.3 (-15.1%) pyston raytrace.py : 6.1s baseline: 6.2 (-0.7%) pyston nbody.py : 8.4s baseline: 8.1 (+4.1%) pyston fannkuch.py : 7.5s baseline: 7.5 (+0.2%) pyston chaos.py : 20.2s baseline: 20.0 (+0.7%) pyston fasta.py : 5.4s baseline: 5.4 (+0.3%) pyston pidigits.py : 5.7s baseline: 5.7 (+0.0%) pyston richards.py : 2.5s baseline: 2.7 (-6.2%) pyston deltablue.py : 1.8s baseline: 1.8 (-0.0%) pyston (geomean-3424) : 5.7s baseline: 5.8 (-2.0%) I looked into the regression in nbody.py, and it is in an unrelated piece of code (list unpacking) that has the same assembly and gets called the same number of times. Maybe there's some weird cache collision. It's an extremely small benchmark (a single 13-line loop) so I'm happy to write it off as microbenchmark sensitivity. We can also optimize this if we want to; we could speculate on the type that we are unpacking and inline the parts of the unpacking code we need.
-
Kevin Modzelewski authored
switch to CPython's thread._local implementation
-
Kevin Modzelewski authored
ie make sure both that it's a valid allocation, but also that it will have Python destructor semantics applied when it is freed (as opposed to, say, STLCompatAllocator-allocated memory). This is to make sure that extension modules don't use a different allocation routine than we expected. There are only a few specialized places that I could find that we actually want the old-behavior; in dump(), and in PyObject_Init right before we call setIsPythonObject. So for those cases, add a new isValidGCMemory call that doesn't do the allocation-kind check.
-
- 27 May, 2015 1 commit
-
-
Kevin Modzelewski authored
We were using a slightly different (flattened) version; not sure why.
-