reader: optimize BackoffTimer using float #156

ploxiln · 2016-06-04T05:06:32Z

my version of #155

performance tested with @virtuald's rawperf.py on a plugged-in 2013 macbook pro retina 13"

master: about 16s
optim_backofftimer: about 12s

Maybe a refactor - to not call these methods so much - is what the library really needs, but this was a fun easy exercise 😁

mreiferson · 2016-06-04T14:39:39Z

I still think this is silly, the culprit is clearly (at least) this line https://github.com/nsqio/pynsq/blob/master/nsq/reader.py#L358, if that performs no decimal calculations how much does that buy us?

FWIW this PR might be solving exactly that, since it caches the value, but do we need all the other noise?

ploxiln · 2016-06-04T18:10:05Z

I think using Decimal for this purpose is silly. This code doesn't really care what kind of float. Isn't it a simplification to not use Decimal? To not convert float -> string -> Decimal -> float?

It just happens that the numbers used for short_length and long_length have both 2 and 5 as prime factors instead of just 2, so base 10 avoids a rounding errors in short_unit and long_unit that compound, that cause you to be an iota off from 0 after a test does the same number of failure() and success() calls. (Also the fact that the test uses min_interval=0.1, another contributor to a 5 factor in the denominator.)

I'll run the simple benchmark with just the get_interval() caching just to see. But @virtuald's profiling suggested that success()` was also called a lot more often than we would expect (not just when in backoff state).

mreiferson · 2016-06-04T18:26:26Z

I don't disagree about decimal/float, but we should first make sure it's called only when necessary and then optimize it if necessary.

This code is ancient Bitly code pre-dating pynsq.

ploxiln · 2016-06-04T20:36:35Z

Putting everything back, and then caching just get_interval(), benchmark run-time was about 14.5 seconds. Adding short-circuiting of success() brought that down to 12 seconds. So right now, success() is actually more significant because it has more Decimal operations. (I think this reader benchmark is constantly on the edge of max_in_flight, which I'm guessing is related to the "continue" event.)

I also think it's weird that the test uses min_interval=0.1, but actual pynsq reader always uses min_interval=0, and that's actually a significant number to a branch in the reader code. So if I were to do the "big cleanup" option for this code ... I would also get rid of min_interval :)

ploxiln · 2016-06-06T05:47:39Z

let me know if/when to squash

mreiferson · 2016-06-06T05:52:18Z

🔨

cache the calculation+conversion for get_interval() if get_interval() is 0, then success() can be short-circuited

virtuald · 2016-06-06T10:19:23Z

I'll run this locally later today and see how it shows up in the profiling output.

mreiferson · 2016-06-06T14:34:56Z

thanks!

mreiferson added the feature label Jun 4, 2016

mreiferson changed the title ~~optimize BackoffTimer using float~~ reader: optimize BackoffTimer using float Jun 4, 2016

ploxiln force-pushed the optim_backofftimer branch from b02ada4 to d739920 Compare June 4, 2016 20:35

jehiah added performance and removed feature labels Jun 5, 2016

mreiferson mentioned this pull request Jun 6, 2016

reader: don't use Decimal class to calculate backoff #155

Closed

optimize BackoffTimer by caching and short-circuiting

f632b56

cache the calculation+conversion for get_interval() if get_interval() is 0, then success() can be short-circuited

ploxiln force-pushed the optim_backofftimer branch from d739920 to f632b56 Compare June 6, 2016 06:06

mreiferson merged commit 4cdb4eb into nsqio:master Jun 6, 2016

ploxiln deleted the optim_backofftimer branch May 1, 2017 18:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reader: optimize BackoffTimer using float #156

reader: optimize BackoffTimer using float #156

ploxiln commented Jun 4, 2016 •

edited

Loading

mreiferson commented Jun 4, 2016

ploxiln commented Jun 4, 2016

mreiferson commented Jun 4, 2016

ploxiln commented Jun 4, 2016

ploxiln commented Jun 6, 2016

mreiferson commented Jun 6, 2016

virtuald commented Jun 6, 2016

mreiferson commented Jun 6, 2016

reader: optimize BackoffTimer using float #156

reader: optimize BackoffTimer using float #156

Conversation

ploxiln commented Jun 4, 2016 • edited Loading

mreiferson commented Jun 4, 2016

ploxiln commented Jun 4, 2016

mreiferson commented Jun 4, 2016

ploxiln commented Jun 4, 2016

ploxiln commented Jun 6, 2016

mreiferson commented Jun 6, 2016

virtuald commented Jun 6, 2016

mreiferson commented Jun 6, 2016

ploxiln commented Jun 4, 2016 •

edited

Loading