Minor performance tweaks #158

virtuald · 2016-06-06T04:08:27Z

The first one is pretty self-explanatory (maybe needs another PR?) -- only call time.time() once, the extra function overhead (despite how small it is) isn't necessary.

The second one we can debate, in the reader it now immediately calls _start_read() instead of going through the IOLoop to call it. This results in a ~1s improvement in my rawperf script. I think I understand why it was done this way originally -- presumably don't want to starve the IOLoop if messages just happen to come in at the perfect rate where the socket buffer can keep feeding messages... since the read functions will call each other continuously in a loop without yielding to the IOLoop.

But... I wonder if that can actually happen?

If it can happen... then perhaps it should only use add_callback every once in awhile, or if it detects a cycle... or some other probably terrible thing? At the very least, should add a note as to why the code is using add_callback here.

mreiferson · 2016-06-06T05:55:35Z

@virtuald same question here - any statistically significant improvement here?

I worry about chasing micro-optimizations...

mreiferson · 2016-06-06T05:58:23Z

Above question was w/r/t the time.time change, sorry.

re: the deferred read, I think the change is safe because _read_bytes just calls self.stream.read_bytes, which defers if necessary.

👍

virtuald · 2016-06-06T10:21:02Z

The time.time change isn't significant.

Heh, well, it's clear that whomever wrote that code originally really wanted to make sure that it defers to the ioloop, since the test checks for it. I'll fix the test later today.

jehiah · 2016-06-06T12:54:02Z

nsq/async.py

@@ -277,7 +277,7 @@ def _read_body(self, data):
            self.trigger(event.DATA, conn=self, data=data)
        except Exception:
            logger.exception('uncaught exception in data event')
-        self.io_loop.add_callback(self._start_read)


I have several spots where i use several independent reader's on the same IOLoop so yielding is important to me. It's also important because calling things directly can lead to infinite call stacks. I think that makes me 👎 for this change.

Yeah, that seems reasonable enough. I wonder if there's a more efficient way of doing this then?

I'm pretty sure it won't "block", it'll either immediately handle unprocessed (but read) bytes or it will defer and setup a callback.

http://www.tornadoweb.org/en/stable/_modules/tornado/iostream.html#BaseIOStream.read_bytes

This just skips an additional deferment.

Right -- the question I have is whether it's possible that enough data could be fed in at the right rhythm that would cause it to never defer. That's something I'm a bit unsure about.

From my scanning of the tornado code, I think it's functionally equivalent with one less deferment.

@mreiferson thanks for digging a little further. I agree, and that addresses my previous concern.

mreiferson · 2016-06-06T14:38:24Z

Let's drop the time.time change if you don't mind.

Assuming @jehiah is amenable to the other change and we fix the failing test, this LGTM!

virtuald · 2016-06-10T18:23:14Z

I fixed the tests... it looks like one of the tests on Travis failed with a spurious error (connection refused to nsqd), not totally sure what caused that.

mreiferson · 2016-06-10T18:25:28Z

restarted those individual tests, let's see if we get a 🍏

mreiferson · 2016-06-10T18:28:01Z

LGTM

mreiferson · 2016-06-10T18:28:06Z

thanks!

jehiah reviewed Jun 6, 2016
View reviewed changes

jehiah added the performance label Jun 6, 2016

virtuald force-pushed the minor branch from f552592 to 5f582a9 Compare June 10, 2016 17:28

Async: Immediately start the next read cycle

8825b43

virtuald force-pushed the minor branch from 5f582a9 to 8825b43 Compare June 10, 2016 18:13

mreiferson merged commit 55ea759 into nsqio:master Jun 10, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor performance tweaks #158

Minor performance tweaks #158

virtuald commented Jun 6, 2016

mreiferson commented Jun 6, 2016

mreiferson commented Jun 6, 2016

virtuald commented Jun 6, 2016

jehiah Jun 6, 2016

virtuald Jun 6, 2016

mreiferson Jun 6, 2016

virtuald Jun 6, 2016

mreiferson Jun 6, 2016

jehiah Jun 6, 2016

mreiferson commented Jun 6, 2016

virtuald commented Jun 10, 2016

mreiferson commented Jun 10, 2016

mreiferson commented Jun 10, 2016

mreiferson commented Jun 10, 2016

Minor performance tweaks #158

Minor performance tweaks #158

Conversation

virtuald commented Jun 6, 2016

mreiferson commented Jun 6, 2016

mreiferson commented Jun 6, 2016

virtuald commented Jun 6, 2016

jehiah Jun 6, 2016

Choose a reason for hiding this comment

virtuald Jun 6, 2016

Choose a reason for hiding this comment

mreiferson Jun 6, 2016

Choose a reason for hiding this comment

virtuald Jun 6, 2016

Choose a reason for hiding this comment

mreiferson Jun 6, 2016

Choose a reason for hiding this comment

jehiah Jun 6, 2016

Choose a reason for hiding this comment

mreiferson commented Jun 6, 2016

virtuald commented Jun 10, 2016

mreiferson commented Jun 10, 2016

mreiferson commented Jun 10, 2016

mreiferson commented Jun 10, 2016