Performance Summary
The performance results of benchmarking the Isaac ROS packages on both Jetson and x86_64 platforms are below:
Node
|
Input Size
|
AGX Orin
|
Orin NX
|
Orin Nano 8GB
|
x86_64 w/ RTX 4060 Ti
|
x86_64 w/ RTX 4090
|
---|---|---|---|---|---|---|
720p
|
7.3 ms @ 30Hz
|
12 ms @ 30Hz
|
18 ms @ 30Hz
|
2.4 ms @ 30Hz
|
2.1 ms @ 30Hz
|
|
576p
|
1.4 ms @ 30Hz
|
1.8 ms @ 30Hz
|
2.6 ms @ 30Hz
|
0.50 ms @ 30Hz
|
0.48 ms @ 30Hz
|
|
576p
|
76 ms @ 30Hz
|
92 ms @ 30Hz
|
–
|
35 ms @ 30Hz
|
35 ms @ 30Hz
|
|
720p
|
690 ms @ 30Hz
|
–
|
–
|
–
|
170 ms @ 30Hz
|
|
576p
|
13 ms @ 30Hz
|
27 ms @ 30Hz
|
–
|
5.5 ms @ 30Hz
|
2.4 ms @ 30Hz
|
|
288p
|
5.9 ms @ 30Hz
|
10 ms @ 30Hz
|
–
|
2.4 ms @ 30Hz
|
1.7 ms @ 30Hz
|
|
1080p
|
7.5 ms @ 30Hz
|
15 ms @ 30Hz
|
22 ms @ 30Hz
|
3.4 ms @ 30Hz
|
1.8 ms @ 30Hz
|
|
1080p
|
2.5 ms @ 30Hz
|
3.5 ms @ 30Hz
|
5.2 ms @ 30Hz
|
0.88 ms @ 30Hz
|
0.66 ms @ 30Hz
|
|
DOPE
|
VGA
|
24 ms @ 30Hz
|
56 ms @ 30Hz
|
82 ms @ 30Hz
|
13 ms @ 30Hz
|
5.1 ms @ 30Hz
|
DOPE
|
VGA
|
23 ms @ 30Hz
|
540 ms @ 30Hz
|
790 ms @ 30Hz
|
12 ms @ 30Hz
|
4.6 ms @ 30Hz
|
PeopleSemSegNet
|
544p
|
4.1 ms @ 30Hz
|
6.1 ms @ 30Hz
|
7.0 ms @ 30Hz
|
2.9 ms @ 30Hz
|
3.0 ms @ 30Hz
|
PeopleSemSegNet
|
544p
|
4.8 ms @ 30Hz
|
6.5 ms @ 30Hz
|
–
|
2.2 ms @ 30Hz
|
1.9 ms @ 30Hz
|
VGA
|
12 ms @ 30Hz
|
12 ms @ 30Hz
|
–
|
6.6 ms @ 30Hz
|
6.4 ms @ 30Hz
|
|
~50 sq. m
|
57 ms @ 30Hz
|
130 ms @ 30Hz
|
190 ms @ 30Hz
|
21 ms @ 30Hz
|
12 ms @ 30Hz
|
|
1080p
|
8.1 ms @ 30Hz
|
–
|
–
|
3.8 ms @ 30Hz
|
4.3 ms @ 30Hz
|
|
I-frame Support
|
1080p
|
12 ms @ 30Hz
|
–
|
–
|
3.3 ms @ 30Hz
|
3.2 ms @ 30Hz
|
P-frame Support
|
1080p
|
9.1 ms @ 30Hz
|
–
|
–
|
2.3 ms @ 30Hz
|
2.1 ms @ 30Hz
|
–
|
77.1 ms
|
151 ms
|
91.2 ms
|
85.3 ms
|
64.2 ms
|
Graph
|
Input Size
|
AGX Orin
|
Orin NX
|
Orin Nano 8GB
|
x86_64 w/ RTX 4060 Ti
|
x86_64 w/ RTX 4090
|
---|---|---|---|---|---|---|
720p
|
9.5 ms @ 30Hz
|
15 ms @ 30Hz
|
21 ms @ 30Hz
|
3.4 ms @ 30Hz
|
2.9 ms @ 30Hz
|
|
576p
|
77 ms @ 30Hz
|
99 ms @ 30Hz
|
100 ms @ 30Hz
|
32 ms @ 30Hz
|
38 ms @ 30Hz
|
|
VGA
|
4.8 ms @ 30Hz
|
4.9 ms @ 30Hz
|
7.4 ms @ 30Hz
|
23 ms @ 30Hz
|
20 ms @ 30Hz
|
|
VGA
|
42 ms @ 30Hz
|
76 ms @ 30Hz
|
–
|
24 ms @ 30Hz
|
14 ms @ 30Hz
|
|
576p
|
5.4 ms @ 30Hz
|
36 ms @ 30Hz
|
–
|
3.7 ms @ 30Hz
|
5.7 ms @ 30Hz
|
|
288p
|
5.9 ms @ 30Hz
|
8.5 ms @ 30Hz
|
–
|
6.1 ms @ 30Hz
|
5.6 ms @ 30Hz
|
|
1080p
|
12 ms @ 30Hz
|
20 ms @ 30Hz
|
28 ms @ 30Hz
|
2.5 ms @ 30Hz
|
2.7 ms @ 30Hz
|
|
544p
|
20 ms @ 30Hz
|
26 ms @ 30Hz
|
36 ms @ 30Hz
|
10 ms @ 30Hz
|
10 ms @ 30Hz
|
|
SyntheticaDETR
|
720p
|
24 ms @ 30Hz
|
41 ms @ 30Hz
|
61 ms @ 30Hz
|
8.7 ms @ 30Hz
|
6.3 ms @ 30Hz
|
PeopleSemSegNet
|
544p
|
19 ms @ 30Hz
|
20 ms @ 30Hz
|
23 ms @ 30Hz
|
11 ms @ 30Hz
|
9.3 ms @ 30Hz
|
Full SAM
|
720p
|
470 ms @ 30Hz
|
–
|
–
|
–
|
79 ms @ 30Hz
|
Mobile SAM
|
720p
|
880 ms @ 30Hz
|
1500 ms @ 30Hz
|
360 ms @ 30Hz
|
62 ms @ 30Hz
|
27 ms @ 30Hz
|
Live Graph
|
Input Size
|
Nova Carter
|
---|---|---|
4 Hawk Cameras
|
1200p
|
0 dropped frames (avg)
|
4 Hawk Cameras
|
1200p
|
|
1200p
|
||
1200p
|
Methodology
The performance numbers provided above are measured using Isaac ROS Benchmark. Configuration files for reproducing results are provided in the launch scripts for each benchmark here.
Frames per second (fps) are measured using the maximum sustained framerate for the accelerated computational pipeline on each listed platforms. Performance measured includes input node → graph of node(s) under performance test → output node where the publishing rate of the input node is auto-tuned to discover the peak throughput dropping <5% of the frames. Average fps computed over 5 runs, discarding minimum and maximum runs; latency measured at 30hz publishing rate matching sensor capture rate.