byteSliceReader compatibility with flate.Reader interface #978

moredure · 2021-02-22T19:27:13Z

Add compatibility with flate.Reader to reduce allocations of bufio.NewReader structs backed by default size slices of 4096 bytes for WriteGunzip, WriteDeflate.

Add compatibility with flate.Reader to reduce allocations of bufio.NewReader structs backed by default size slices.

dgrr · 2021-02-22T19:31:25Z

This is going to increase allocation. It'd be better to use: r.b = append(r.b[:0], r.b[1:]...). Right?

erikdubbelboer · 2021-02-22T19:34:54Z

I don't think this is going to cause an allocation. The same backing array will be used and the data pointer in slice will just be overwritten.

I had no idea flate.Reader did this, good catch!

dgrr · 2021-02-22T19:39:42Z

Right! Though the content was appended at the end of the slice.

moredure · 2021-02-24T12:27:53Z

Also looks like

type slicer []byte

func (z *slicer) Read(p []byte) (int, error) {
	if len(*z) == 0 {
		return 0, io.EOF
	}
	n := copy(p, *z)
	*z = (*z)[n:]
	return n, nil
}

func (z *slicer) ReadByte() (byte, error) {
	if len(*z) == 0 {
		return 0, io.EOF
	}
	n := (*z)[0]
	*z = (*z)[1:]
	return n, nil
}

Should perform a few nanoseconds better than a struct:

func BenchmarkBytesReadSlice(b *testing.B) {
	zb := make([]byte, 1024)
	z := bytes.Repeat([]byte("a"), 1024 * 1024)
	b.Run("ReadByteStruct", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := &byteSliceReader{z}
			ReadByte(b)
		}
	})
	b.Run("ReadByteSlice", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := slicer(z)
			ReadByte(&b)
		}
	})
	b.Run("BytesBufferReadSlice", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := bytes.NewReader(z)
			ReadByte(b)
		}
	})
	b.Run("BytesBufferReadSlice", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := &byteSliceReader2{z, 0}
			ReadByte(b)
		}
	})
	b.Run("ReadStruct", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := &byteSliceReader{z}
			Read(b, zb)
		}
	})

	b.Run("ReadSlice", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := slicer(z)
			Read(&b, zb)
		}
	})
	b.Run("BytesBufferRead", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := bytes.NewReader(z)
			Read(b, zb)
		}
	})
	b.Run("BytesBufferRead2", func(b *testing.B) {
		b.ReportAllocs()
		for i := 0; i < b.N; i++ {
			b := &byteSliceReader2{z, 0}
			Read(b, zb)
		}
	})
}


type byteSliceReader2 struct {
	b []byte
	i int64
}

func (r *byteSliceReader2) Read(b []byte) (n int, err error) {
	if r.i >= int64(len(r.b)) {
		return 0, io.EOF
	}
	n = copy(b, r.b[r.i:])
	r.i += int64(n)
	return n, nil
}

func (r *byteSliceReader2) ReadByte() (byte, error) {
	if r.i >= int64(len(r.b)) {
		return 0, io.EOF
	}
	b := r.b[r.i]
	r.i++
	return b, nil
}


func Read(r flate.Reader, zb []byte) {
	for {
		_, err := r.Read(zb)
		if err == io.EOF {
			return
		}
	}
}

func ReadByte(r flate.Reader) {
	for {
		_, err := r.ReadByte()
		if err == io.EOF {
			return
		}
	}
}

cpu: Intel(R) Core(TM) i5-7600 CPU @ 3.50GHz
BenchmarkBytesReadSlice
BenchmarkBytesReadSlice/ReadByteStruct
BenchmarkBytesReadSlice/ReadByteStruct-4         	     412	   2885174 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/ReadByteSlice
BenchmarkBytesReadSlice/ReadByteSlice-4          	     418	   2829745 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/BytesBufferReadSlice
BenchmarkBytesReadSlice/BytesBufferReadSlice-4   	     526	   2255795 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/BytesBufferReadSlice#01
BenchmarkBytesReadSlice/BytesBufferReadSlice#01-4         	     547	   2153476 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/ReadStruct
BenchmarkBytesReadSlice/ReadStruct-4                      	   44881	     25967 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/ReadSlice
BenchmarkBytesReadSlice/ReadSlice-4                       	   45585	     25976 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/BytesBufferRead
BenchmarkBytesReadSlice/BytesBufferRead-4                 	   45195	     26735 ns/op	       0 B/op	       0 allocs/op
BenchmarkBytesReadSlice/BytesBufferRead2
BenchmarkBytesReadSlice/BytesBufferRead2-4                	   45588	     26490 ns/op	       0 B/op	       0 allocs/op

dgrr · 2021-02-24T12:53:02Z

@moredure I think few nanos doesn't make that much of a difference, but I think cache-wise should be better right? I don't know exactly how Golang manages memory, but having a pointer to a struct looks like it makes the computer perform 2 lookups (?). One lookup for the pointer where the structure is and then another lookup for the pointer to the slice.
If that assertion is correct, I imagine having just the slice makes more sense.

moredure · 2021-02-24T13:00:14Z

Anyway there will be a need to get address (&) of slicer instance to allow passing to interface function parameters (io.Reader, flate.Reader, etc)

moredure · 2021-02-24T13:15:32Z

If flate readers using ReadByte frequently maybe bytes.NewReader is better) than custom solution)

moredure · 2021-02-24T13:21:10Z

In the end

type byteSliceReader2 struct {
	b []byte
	i int64
}

func (r *byteSliceReader2) Read(b []byte) (n int, err error) {
	if r.i >= int64(len(r.b)) {
		return 0, io.EOF
	}
	n = copy(b, r.b[r.i:])
	r.i += int64(n)
	return n, nil
}

func (r *byteSliceReader2) ReadByte() (byte, error) {
	if r.i >= int64(len(r.b)) {
		return 0, io.EOF
	}
	b := r.b[r.i]
	r.i++
	return b, nil
}

same code as bytes.Reader but without prevRune) in case of ReadByte only payload performs best, not sure if it fits flate.Reader usage

erikdubbelboer · 2021-02-24T19:44:58Z

I think the current change is good enough and is in line with how the rest works. If we want to try and speed up byteSliceReader as a whole that should be a different pull request.

Update compress.go

8d25a0f

Add compatibility with flate.Reader to reduce allocations of bufio.NewReader structs backed by default size slices.

moredure changed the title ~~Update compress.go~~ byteSliceReader compatibility with flate.Reader interface Feb 24, 2021

erikdubbelboer merged commit 0880335 into valyala:master Feb 24, 2021

moredure deleted the patch-2 branch March 2, 2021 12:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

byteSliceReader compatibility with flate.Reader interface #978

byteSliceReader compatibility with flate.Reader interface #978

moredure commented Feb 22, 2021 •

edited

Loading

dgrr commented Feb 22, 2021

erikdubbelboer commented Feb 22, 2021

dgrr commented Feb 22, 2021

moredure commented Feb 24, 2021 •

edited

Loading

dgrr commented Feb 24, 2021

moredure commented Feb 24, 2021

moredure commented Feb 24, 2021

moredure commented Feb 24, 2021 •

edited

Loading

erikdubbelboer commented Feb 24, 2021

byteSliceReader compatibility with flate.Reader interface #978

byteSliceReader compatibility with flate.Reader interface #978

Conversation

moredure commented Feb 22, 2021 • edited Loading

dgrr commented Feb 22, 2021

erikdubbelboer commented Feb 22, 2021

dgrr commented Feb 22, 2021

moredure commented Feb 24, 2021 • edited Loading

dgrr commented Feb 24, 2021

moredure commented Feb 24, 2021

moredure commented Feb 24, 2021

moredure commented Feb 24, 2021 • edited Loading

erikdubbelboer commented Feb 24, 2021

moredure commented Feb 22, 2021 •

edited

Loading

moredure commented Feb 24, 2021 •

edited

Loading

moredure commented Feb 24, 2021 •

edited

Loading