test: add a stress test for the https outcalls feature #4449

mihailjianu1 · 2025-03-20T15:31:06Z

This test adds an update message to the already existing proxy_canister, which performs count concurrent outcalls.

However, as the number of canister messages is limited to 500, the requests have to be split into batches.

All requests are lightweight both in therms of request size and response size.

The test sets up two testnets, with 13 and 40 nodes respectively. For each of them it tries to send 200, 500 and 1000 requests, and measures the average qps for each (over 3 experiments).

Results are as follows:

QPS:
node# \ concurrency_level | 200 | 500 | 1000
13:                       | 89  | 155 | 144
40:                       | 52  | 70  | 68

Interpretation:

The canister measures the time of the http_request to the management canister. This includes ingress message processing, consensus, block making etc etc.
The 40 node subnet is slower mainly because consensus takes longer, and there's a better chance of some adapters slightly slower
Not fully saturating the 500 concurrency limit yields worse qps.
In the current setup, the qps is also correlated with the delay of the target server.

Colocating nodes doesn't seem to affect the average qps by a lot (sometimes it's ~10% higher / lower, but there's no clear indication that it's way better / worse), as the main bottleneck here is the 500 limit on the concurrent canister messages.

Future attempts at stressing the feature may include:

Installing multiple proxy canisters on the same subnet to bypass the 500 limit
Fully saturating the bottleneck by continuously sending new requests in the message queue instead of waiting for all 500 to return.

Sawchord

Awesome! Thanks!

kpop-dfinity · 2025-03-28T17:22:00Z

Great stuff!

Non-blocking comment: I haven't looked at the code yet but two more things to consider would be:

Run the test on the performance cluster to simulate the production environment more closely(see for example). I'm not entirely sure if it's going to work because in your setup you need 53+ nodes and I don't know how big the performance cluster is.
Impose some artificial network conditions (example) to see how the QPS correlates with the latency between nodes. If you want to be even fancier, you might want to use ProductionSubnetTopology::UZR34 to simulate a subnet which has some heavy https outcalls traffic.

kpop-dfinity · 2025-03-28T17:33:56Z

rs/tests/networking/canister_http_stress_test.rs

+    url: String,
+    logger: &Logger,
+    concurrent_requests: u64,
+) -> Result<u64, anyhow::Error> {


maybe it would be easier to use if we return std::time::Duration here?

Suggested change

) -> Result<u64, anyhow::Error> {

) -> Result<Duration, anyhow::Error> {

stress test

1379157

github-actions bot added the test label Mar 20, 2025

mihailjianu1 added 11 commits March 20, 2025 15:46

clippy

dcf6d94

clippy #2

76395e5

draft

2ce79ad

more changes

4b48b32

something working

ebf2835

cleanup

34ad7c0

delete smal

9c69f17

clippy

fe32ef8

clippy + comments

ebd6989

typo

8870e00

fix

7371a4c

mihailjianu1 marked this pull request as ready for review March 28, 2025 10:33

mihailjianu1 requested a review from a team as a code owner March 28, 2025 10:33

remove dynamic_testnet

ef69d4b

Sawchord approved these changes Mar 28, 2025

View reviewed changes

mihailjianu1 requested review from gbrel and a team March 28, 2025 13:48

kpop-dfinity reviewed Mar 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: add a stress test for the https outcalls feature #4449

test: add a stress test for the https outcalls feature #4449

mihailjianu1 commented Mar 20, 2025 •

edited

Loading

Sawchord left a comment

kpop-dfinity commented Mar 28, 2025 •

edited

Loading

kpop-dfinity Mar 28, 2025

	) -> Result<u64, anyhow::Error> {
	) -> Result<Duration, anyhow::Error> {

test: add a stress test for the https outcalls feature #4449

Are you sure you want to change the base?

test: add a stress test for the https outcalls feature #4449

Conversation

mihailjianu1 commented Mar 20, 2025 • edited Loading

Sawchord left a comment

Choose a reason for hiding this comment

kpop-dfinity commented Mar 28, 2025 • edited Loading

kpop-dfinity Mar 28, 2025

Choose a reason for hiding this comment

mihailjianu1 commented Mar 20, 2025 •

edited

Loading

kpop-dfinity commented Mar 28, 2025 •

edited

Loading