cpython

mirror of https://github.com/python/cpython.git synced 2024-11-23 09:54:58 +08:00

Author	SHA1	Message	Date
Sam Gross	4759ba6eec	gh-127022: Simplify `PyStackRef_FromPyObjectSteal` (#127024 ) This gets rid of the immortal check in `PyStackRef_FromPyObjectSteal()`. Overall, this improves performance about 2% in the free threading build. This also renames `PyStackRef_Is()` to `PyStackRef_IsExactly()` because the macro requires that the tag bits of the arguments match, which is only true in certain special cases.	2024-11-22 12:55:33 -05:00
Kirill Podoprigora	27486c3365	gh-115999: Add free-threaded specialization for `UNPACK_SEQUENCE` (#126600 ) Add free-threaded specialization for `UNPACK_SEQUENCE` opcode. `UNPACK_SEQUENCE_TUPLE/UNPACK_SEQUENCE_TWO_TUPLE` are already thread safe since tuples are immutable. `UNPACK_SEQUENCE_LIST` is not thread safe because of nature of lists (there is nothing preventing another thread from adding items to or removing them the list while the instruction is executing). To achieve thread safety we add a critical section to the implementation of `UNPACK_SEQUENCE_LIST`, especially around the parts where we check the size of the list and push items onto the stack. --------- Co-authored-by: Matt Page <mpage@meta.com> Co-authored-by: mpage <mpage@cs.stanford.edu>	2024-11-22 19:00:35 +02:00
Donghee Na	78a530a578	gh-115999: Add free-threaded specialization for ``TO_BOOL`` (gh-126616)	2024-11-22 07:52:16 +09:00
mpage	09c240f20c	gh-115999: Specialize `LOAD_GLOBAL` in free-threaded builds (#126607 ) Enable specialization of LOAD_GLOBAL in free-threaded builds. Thread-safety of specialization in free-threaded builds is provided by the following: A critical section is held on both the globals and builtins objects during specialization. This ensures we get an atomic view of both builtins and globals during specialization. Generation of new keys versions is made atomic in free-threaded builds. Existing helpers are used to atomically modify the opcode. Thread-safety of specialized instructions in free-threaded builds is provided by the following: Relaxed atomics are used when loading and storing dict keys versions. This avoids potential data races as the dict keys versions are read without holding the dictionary's per-object lock in version guards. Dicts keys objects are passed from keys version guards to the downstream uops. This ensures that we are loading from the correct offset in the keys object. Once a unicode key has been stored in a keys object for a combined dictionary in free-threaded builds, the offset that it is stored in will never be reused for a different key. Once the version guard passes, we know that we are reading from the correct offset. The dictionary read fast-path is used to read values from the dictionary once we know the correct offset.	2024-11-21 11:22:21 -08:00
Mark Shannon	aea0c586d1	GH-127010: Don't lazily track and untrack dicts (GH-127027)	2024-11-20 16:41:20 +00:00
Brandt Bucher	48c50ff1a2	GH-126892: Reset warmup counters when JIT compiling code (GH-126893)	2024-11-20 08:11:25 -08:00
Hugo van Kemenade	899fdb213d	Revert "GH-126491: GC: Mark objects reachable from roots before doing cycle collection (GH-126502)" (#126983 )	2024-11-19 11:25:09 +02:00
Mark Shannon	b0fcc2c47a	GH-126491: GC: Mark objects reachable from roots before doing cycle collection (GH-126502) * Mark almost all reachable objects before doing collection phase * Add stats for objects marked * Visit new frames before each increment * Remove lazy dict tracking * Update docs * Clearer calculation of work to do.	2024-11-18 14:31:26 +00:00
Sergey B Kirpichev	d9e251223e	gh-103951: enable optimization for fast attribute access on module subclasses (GH-126264) Co-authored-by: Nicolas Tessore <n.tessore@ucl.ac.uk>	2024-11-15 16:03:38 +08:00
Ken Jin	6293d00e72	gh-120619: Strength reduce function guards, support 2-operand uop forms (GH-124846) Co-authored-by: Brandt Bucher <brandtbucher@gmail.com>	2024-11-09 11:35:33 +08:00
Donghee Na	4ea214ea98	gh-115999: Add free-threaded specialization for CONTAINS_OP (gh-126450) - The specialization logic determines the appropriate specialization using only the operand's type, which is safe to read non-atomically (changing it requires stopping the world). We are guaranteed that the type will not change in between when it is checked and when we specialize the bytecode because the types involved are immutable (you cannot assign to `__class__` for exact instances of `dict`, `set`, or `frozenset`). The bytecode is mutated atomically using helpers. - The specialized instructions rely on the operand type not changing in between the `DEOPT_IF` checks and the calls to the appropriate type-specific helpers (e.g. `_PySet_Contains`). This is a correctness requirement in the default builds and there are no changes to the opcodes in the free-threaded builds that would invalidate this.	2024-11-06 03:35:10 +00:00
Peter Bierma	1371295e67	gh-126366: Fix crash if `__iter__` raises an exception during `yield from` (#126369 )	2024-11-05 15:26:36 +05:30
Kirill Podoprigora	78015818c2	gh-126415: Fix conversion warning in `Python/bytecodes.c` (#126416 ) Fix conversion warning in bytecodes Co-authored-by: mpage <mpage@cs.stanford.edu>	2024-11-05 04:12:31 +02:00
mpage	2e95c5ba3b	gh-115999: Implement thread-local bytecode and enable specialization for `BINARY_OP` (#123926 ) Each thread specializes a thread-local copy of the bytecode, created on the first RESUME, in free-threaded builds. All copies of the bytecode for a code object are stored in the co_tlbc array on the code object. Threads reserve a globally unique index identifying its copy of the bytecode in all co_tlbc arrays at thread creation and release the index at thread destruction. The first entry in every co_tlbc array always points to the "main" copy of the bytecode that is stored at the end of the code object. This ensures that no bytecode is copied for programs that do not use threads. Thread-local bytecode can be disabled at runtime by providing either -X tlbc=0 or PYTHON_TLBC=0. Disabling thread-local bytecode also disables specialization. Concurrent modifications to the bytecode made by the specializing interpreter and instrumentation use atomics, with specialization taking care not to overwrite an instruction that was instrumented concurrently.	2024-11-04 11:13:32 -08:00
Tomas R.	aab58a93ef	gh-118423: Add `INSTRUCTION_SIZE` macro to code generator (GH-125467)	2024-10-29 17:25:05 +00:00
Mark Shannon	faa3272fb8	GH-125837: Split `LOAD_CONST` into three. (GH-125972) * Add LOAD_CONST_IMMORTAL opcode * Add LOAD_SMALL_INT opcode * Remove RETURN_CONST opcode	2024-10-29 11:15:42 +00:00
Mark Shannon	25441592db	GH-125515: Reduce number of compiler warnings in generated code (GH-125697)	2024-10-28 10:30:31 +00:00
Mark Shannon	b61fece852	GH-125868: Fix STORE_ATTR_WITH_HINT specialization (GH-125876)	2024-10-24 11:57:02 +01:00
Mark Shannon	57e3c59bb6	GH-125521: Remove `if (true)` from generated output to reduce C compiler warnings (GH-125700)	2024-10-22 10:11:29 +01:00
mpage	de5a6c7c7d	gh-121459: Fix a couple of uses of `PyStackRef_FromPyObjectSteal` (#125711 ) * Fix usage of PyStackRef_FromPyObjectSteal in CALL_TUPLE_1 This was missed in gh-124894 * Fix usage of PyStackRef_FromPyObjectSteal in _CALL_STR_1 This was missed in gh-124894 * Regenerate code	2024-10-21 11:08:13 -07:00
sobolevn	0c8c665581	gh-125470: Fix warning in `Python/generated_cases.c.h` (#125471 ) Co-authored-by: Kirill Podoprigora <kirill.bast9@mail.ru>	2024-10-14 23:46:17 +03:00
Mark Shannon	06ca33020e	GH-125323: Convert DECREF_INPUTS_AND_REUSE_FLOAT into a function that takes PyStackRefs. (GH-125439)	2024-10-14 14:18:57 +01:00
Ken Jin	4b358ee647	gh-125323: Remove some unsafe Py_DECREFs in bytecodes.c, replacing them with PyStackRef_CLOSEs (GH-125324)	2024-10-14 09:17:51 +01:00
Mark Shannon	c9014374c5	GH-125174: Make immortal objects more robust, following design from PEP 683 (GH-125251)	2024-10-10 18:19:08 +01:00
mpage	f978fb4f8d	gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys (gh-124953) Each of the `LOAD_GLOBAL` specializations is implemented roughly as: 1. Load keys version. 2. Load cached keys version. 3. Deopt if (1) and (2) don't match. 4. Load keys. 5. Load cached index into keys. 6. Load object from (4) at offset from (5). This is not thread-safe in free-threaded builds; the keys object may be replaced in between steps (3) and (4). This change refactors the specializations to avoid reloading the keys object and instead pass the keys object from guards to be consumed by downstream uops.	2024-10-09 15:18:25 +00:00
Tomas R.	6b533a659b	gh-125039: Make `this_instr`/`prev_instr` const in cases generator (GH-125071)	2024-10-09 13:54:39 +01:00
Mark Shannon	d1453f60c2	GH-121459: Streamline PyObject* to PyStackRef conversions by disallowing NULL pointers. (GH-124894)	2024-10-07 18:13:04 +01:00
Mark Shannon	da071fa3e8	GH-119866: Spill the stack around escaping calls. (GH-124392) * Spill the evaluation around escaping calls in the generated interpreter and JIT. * The code generator tracks live, cached values so they can be saved to memory when needed. * Spills the stack pointer around escaping calls, so that the exact stack is visible to the cycle GC.	2024-10-07 14:56:39 +01:00
Sam Gross	5aa91c56bf	gh-124296: Remove private dictionary version tag (PEP 699) (#124472 )	2024-10-01 12:39:56 -04:00
Ken Jin	198756b0f6	gh-117376: Fix off-by-ones in conversion functions (GH-124301) Fix off-by-ones in conversion function	2024-09-26 02:41:07 +08:00
Sam Gross	f4997bb3ac	gh-123923: Defer refcounting for `f_funcobj` in `_PyInterpreterFrame` (#124026 ) Use a `_PyStackRef` and defer the reference to `f_funcobj` when possible. This avoids some reference count contention in the common case of executing the same code object from multiple threads concurrently in the free-threaded build.	2024-09-24 20:08:18 +00:00
Ken Jin	8810e286fa	gh-121459: Deferred LOAD_GLOBAL (GH-123128) Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com> Co-authored-by: Sam Gross <655866+colesbury@users.noreply.github.com>	2024-09-14 00:23:51 +08:00
Sam Gross	b2afe2aae4	gh-123923: Defer refcounting for `f_executable` in `_PyInterpreterFrame` (#123924 ) Use a `_PyStackRef` and defer the reference to `f_executable` when possible. This avoids some reference count contention in the common case of executing the same code object from multiple threads concurrently in the free-threaded build.	2024-09-12 12:37:06 -04:00
Mark Shannon	4ed7d1d6ac	GH-123996: Explicitly mark 'self_or_null' as an array of size 1 to ensure that it is kept in memory for calls (GH-124003)	2024-09-12 15:32:45 +01:00
Victor Stinner	f1a0d96f41	gh-123091: Use _Py_IsImmortalLoose() (#123511 ) Use _Py_IsImmortalLoose() in bytesobject.c, typeobject.c and ceval.c.	2024-09-02 14:25:19 +02:00
Mark Shannon	54a05a4600	GH-123232: Factor BINARY_SLICE and STORE_SLICE to handle stats properly for tier 2. (GH-123381)	2024-08-27 10:49:39 +01:00
Kirill Podoprigora	67f2c84bff	gh-123205: `Python/bytecodes.c`: Fix compiler warning (#123206 ) Fix MSVC warning "conversion from '__int64' to 'int'"	2024-08-23 15:35:25 -04:00
Mark Shannon	0b0f7befad	GH-123232: Fix "not specialized" stats (GH-123236)	2024-08-23 10:46:03 +01:00
Mark Shannon	5d3201fe3f	GH-123040: Specialize shadowed `LOAD_ATTR`. (GH-123219)	2024-08-23 10:22:35 +01:00
Donghee Na	297f2e093e	gh-123083: Fix a potential use-after-free in ``STORE_ATTR_WITH_HINT`` (gh-123092)	2024-08-22 23:49:09 +09:00
Mark Shannon	a3d8c0542e	GH-123197: Only count an instruction as deferred if it hasn't deopted first. (GH-123222) Only count an instruction as deferred if hasn't deopted first.	2024-08-22 14:17:10 +01:00
Mark Shannon	a4fd7aa4a6	GH-115776: Allow any fixed sized object to have inline values (GH-123192)	2024-08-21 15:52:04 +01:00
Mark Shannon	7b26c4d1e3	GH-123197: Increment correct stat for CALL_KW (GH-123200)	2024-08-21 12:52:28 +01:00
Mark Shannon	1eba8bae92	GH-123185: Check for `NULL` after calling `_PyEvalFramePushAndInit` (GH-123194)	2024-08-21 12:44:56 +01:00
Mark Shannon	bb1d30336e	GH-118093: Make `CALL_ALLOC_AND_ENTER_INIT` suitable for tier 2. (GH-123140) * Convert CALL_ALLOC_AND_ENTER_INIT to micro-ops such that tier 2 supports it * Allow inexact arguments for CALL_ALLOC_AND_ENTER_INIT.	2024-08-20 16:52:58 +01:00
Mark Shannon	c13e7d98fb	GH-118093: Specialize `CALL_KW` (GH-123006)	2024-08-16 17:11:24 +01:00
Brandt Bucher	f84754b705	GH-118093: Turn some DEOPT_IFs into EXIT_IFs (GH-122998)	2024-08-14 07:54:42 -07:00
Mark Shannon	eec7bdaf01	GH-120024: Remove `CHECK_EVAL_BREAKER` macro. (GH-122968) * Factor some instructions into micro-ops to isolate CHECK_EVAL_BREAKER for escape analysis * Eliminate CHECK_EVAL_BREAKER macro	2024-08-14 12:04:05 +01:00
Brandt Bucher	9621a7d017	GH-118093: Handle some polymorphism before requiring progress in tier two (GH-122843)	2024-08-12 12:39:31 -07:00
Sam Gross	ab094d1b2b	gh-117139: Replace _PyList_FromArraySteal with stack ref variant (#122830 ) This replaces `_PyList_FromArraySteal` with `_PyList_FromStackRefSteal`. It's functionally equivalent, but takes a `_PyStackRef` array instead of an array of `PyObject` pointers. Co-authored-by: Ken Jin <kenjin@python.org>	2024-08-12 14:49:49 -04:00

1 2 3 4 5 ...

349 Commits