gh-101444: Optimize bytearray slice assignment for bytes-like object #101445

msoxzw · 2023-01-31T01:18:49Z

In addition to bytearray, bytes-like object supporting buffer protocol could also bypass unnecessary data copies, thereby giving 3 ~ 4 times speedup.

Issue: bytearray slice assignment for bytes-like object is much slower than that for bytearray #101444

bytes is equivalent to immutable bytearray, and therefore, we could safely avoid unnecessary copy, thereby giving 300% speedup or so.

In addition to bytearray, bytes-like object supporting buffer protocol could bypass unnecessary data copies, thereby giving 3 times speedup or so.

msoxzw · 2023-02-08T03:39:49Z

I leveraged GitHub actions to measure optimization results on various systems. Only differences between bytearray and bytes, memoryview are significant, since benchmarks might not be run on the identical hardware.
python -m timeit -s "a=bytearray(4096);b=b'x'*1024;" "a[:1024]=b"
python -m timeit -s "a=bytearray(4096);b=memoryview(b'x'*1024);" "a[:1024]=b"
python -m timeit -s "a=bytearray(4096);b=bytearray(b'x'*1024);" "a[:1024]=b"

Before: https://github.com/msoxzw/cpython/actions/runs/4120204201

System	bytes	memoryview	bytearray
Windows (x86)	392 ns	389 ns	111 ns
Windows (x64)	409 ns	420 ns	112 ns
macOS	397 ns	358 ns	90.2 ns
Ubuntu	292 ns	284 ns	91.1 ns

After: https://github.com/msoxzw/cpython/actions/runs/4120296060

System	bytes	memoryview	bytearray
Windows (x86)	150 ns	163 ns	168 ns
Windows (x64)	83.3 ns	86.2 ns	81.9 ns
macOS	93.3 ns	99.7 ns	98.7 ns
Ubuntu	88.8 ns	76.2 ns	71.5 ns

Therefore, this PR makes bytearray slice assignment for bytes-like object run as fast as that for bytearray.

msoxzw · 2023-02-22T01:48:55Z

I manage to benchmark only on Windows on the same machine through GitHub actions. Buffer protocol would incur marginal ~10% overhead, if bytearray is regarded as buffer object.
https://github.com/msoxzw/cpython/actions/runs/4238205216/jobs/7365033501

time (ns)	bytes	memoryview	bytearray
before	417	405	113
after	120	130	125

If data copies are bypassed only for bytes objects, such performance penalty would be avoided accordingly.
https://github.com/msoxzw/cpython/actions/runs/4238204896/jobs/7365028180

time (ns)	bytes	memoryview	bytearray
before	331	332	93.4
after	89.9	343	91.8

So is it acceptable to such performance overhead?

serhiy-storchaka

How does it work for ba[:] = memoryview(b'abcd')[::2]?

msoxzw · 2024-02-14T20:35:54Z

How does it work for ba[:] = memoryview(b'abcd')[::2]?

Thanks for such wonderful review.

It would work like byte or bytearray concatenation: bytearray() + memoryview(b'abcd')[::2]

This behavior is dictated by PyBUF_SIMPLE flag in PyObject_GetBuffer function. Nevertheless, it is probable to preserve original behavior, and thus maintain API compatibility.

For memoryview slice, such as `ba[:] = memoryview(b'abcd')[::2]`

serhiy-storchaka

Ignoring arbitrary errors in PyObject_GetBuffer() is not good.

How does it work for ba[::-1] = memoryview(ba)? For ba[:0] = memoryview(ba), ba[:2] = memoryview(ba)[-2:]?

Please add tests for all cases in which your intermediate code failed.

Optimize bytearray slice assignment for bytes

3e2323a

bytes is equivalent to immutable bytearray, and therefore, we could safely avoid unnecessary copy, thereby giving 300% speedup or so.

bedevere-bot added the awaiting review label Jan 31, 2023

bedevere-bot mentioned this pull request Jan 31, 2023

bytearray slice assignment for bytes-like object is much slower than that for bytearray #101444

Open

Optimize bytearray slice assignment

a5d67c4

In addition to bytearray, bytes-like object supporting buffer protocol could bypass unnecessary data copies, thereby giving 3 times speedup or so.

msoxzw changed the title ~~gh-101444: Optimize bytearray slice assignment for bytes~~ gh-101444: Optimize bytearray slice assignment for bytes-like object Feb 6, 2023

arhadthedev added performance Performance or resource usage interpreter-core (Objects, Python, Grammar, and Parser dirs) labels Feb 6, 2023

📜🤖 Added by blurb_it.

de5250c

serhiy-storchaka reviewed Feb 2, 2024

View reviewed changes

msoxzw added 2 commits February 14, 2024 20:38

Preserve original bytearray slice assignment behavior

1ae225b

For memoryview slice, such as `ba[:] = memoryview(b'abcd')[::2]`

Merge branch 'main' into bytearray-slice-assignment

37bc052

serhiy-storchaka reviewed Feb 16, 2024

View reviewed changes

May	JUN	Jul
	13
2024	2025	2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-101444: Optimize bytearray slice assignment for bytes-like object #101445

gh-101444: Optimize bytearray slice assignment for bytes-like object #101445

Uh oh!

msoxzw commented Jan 31, 2023 •

edited

Loading

Uh oh!

msoxzw commented Feb 8, 2023

Uh oh!

msoxzw commented Feb 22, 2023

Uh oh!

serhiy-storchaka left a comment

Uh oh!

msoxzw commented Feb 14, 2024

Uh oh!

serhiy-storchaka left a comment

Uh oh!

Uh oh!

Uh oh!

gh-101444: Optimize bytearray slice assignment for bytes-like object #101445

Are you sure you want to change the base?

gh-101444: Optimize bytearray slice assignment for bytes-like object #101445

Uh oh!

Conversation

msoxzw commented Jan 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msoxzw commented Feb 8, 2023

Uh oh!

msoxzw commented Feb 22, 2023

Uh oh!

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Uh oh!

msoxzw commented Feb 14, 2024

Uh oh!

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

msoxzw commented Jan 31, 2023 •

edited

Loading