aboutsummaryrefslogtreecommitdiffstats
path: root/docs/content/infrastructure/testbed_configuration/nvidia_grc_hw_bios_cfg.md
blob: 47163a814e47d08fb1af688bf8071f1ce0da2460 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
---
bookToc: true
title: "NVidia Grace CPU"
---

# MegaRac Altra

## Linux lscpu

```
Architecture:             aarch64
  CPU op-mode(s):         64-bit
  Byte Order:             Little Endian
CPU(s):                   72
  On-line CPU(s) list:    0-71
Vendor ID:                ARM
  Model name:             Neoverse-V2
    Model:                0
    Thread(s) per core:   1
    Core(s) per socket:   72
    Socket(s):            1
    Stepping:             r0p0
    BogoMIPS:             2000.00
    Flags:                fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3
                          sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm ssbs sb paca pacg dcpodp sve2 sveaes svepmull svebitperm
                           svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh bti
Caches (sum of all):
  L1d:                    4.5 MiB (72 instances)
  L1i:                    4.5 MiB (72 instances)
  L2:                     72 MiB (72 instances)
  L3:                     114 MiB (1 instance)
NUMA:
  NUMA node(s):           1
  NUMA node0 CPU(s):      0-71
Vulnerabilities:
  Gather data sampling:   Not affected
  Itlb multihit:          Not affected
  L1tf:                   Not affected
  Mds:                    Not affected
  Meltdown:               Not affected
  Mmio stale data:        Not affected
  Reg file data sampling: Not affected
  Retbleed:               Not affected
  Spec rstack overflow:   Not affected
  Spec store bypass:      Mitigation; Speculative Store Bypass disabled via prctl
  Spectre v1:             Mitigation; __user pointer sanitization
  Spectre v2:             Not affected
  Srbds:                  Not affected
  Tsx async abort:        Not affected
```

## Linux dmidecode

```
# dmidecode 3.5
Getting SMBIOS data from sysfs.
SMBIOS 3.6.0 present.
# SMBIOS implementations newer than version 3.5.0 are not
# fully supported by this version of dmidecode.
Table at 0x3C63C70000.

Handle 0x0000, DMI type 42, 124 bytes
Management Controller Host Interface
        Host Interface Type: Network
        Device Type: <OUT OF SPEC>
        Vendor ID: 0x11:0x25:0x05:0xa2
        Protocol ID: 04 (Redfish over IP)
                Service UUID: a6bd26ba-b0f4-413a-9054-75344b6e5bf5
                Host IP Assignment Type: Static
                Host IP Address Format: IPv4
                IPv4 Address: 10.0.1.2
                IPv4 Mask: 255.255.255.0
                Redfish Service IP Discovery Type: Static
                Redfish Service IP Address Format: IPv4
                IPv4 Redfish Service Address: 10.0.1.1
                IPv4 Redfish Service Mask: 255.255.255.0
                Redfish Service Port: 443
                Redfish Service Vlan: 0
                Redfish Service Hostname: legoc1

Handle 0x0001, DMI type 0, 26 bytes
BIOS Information
        Vendor: NVIDIA
        Version:         00020003
        Release Date: 20240516
        ROM Size: 64 MB
        Characteristics:
                PCI is supported
                PNP is supported
                BIOS is upgradeable
                BIOS shadowing is allowed
                Boot from CD is supported
                Selectable boot is supported
                Serial services are supported (int 14h)
                ACPI is supported
                Targeted content distribution is supported
                UEFI is supported
        Firmware Revision: 24.3

Handle 0x0002, DMI type 7, 27 bytes
Cache Information
        Socket Designation: L1 Instruction Cache
        Configuration: Enabled, Not Socketed, Level 1
        Operational Mode: Write Back
        Location: Internal
        Installed Size: 4608 kB
        Maximum Size: 4608 kB
        Supported SRAM Types:
                Unknown
        Installed SRAM Type: Unknown
        Speed: Unknown
        Error Correction Type: Unknown
        System Type: Instruction
        Associativity: 4-way Set-associative

Handle 0x0003, DMI type 7, 27 bytes
Cache Information
        Socket Designation: L1 Data Cache
        Configuration: Enabled, Not Socketed, Level 1
        Operational Mode: Write Back
        Location: Internal
        Installed Size: 4608 kB
        Maximum Size: 4608 kB
        Supported SRAM Types:
                Unknown
        Installed SRAM Type: Unknown
        Speed: Unknown
        Error Correction Type: Unknown
        System Type: Data
        Associativity: 4-way Set-associative

Handle 0x0004, DMI type 7, 27 bytes
Cache Information
        Socket Designation: L2 Cache
        Configuration: Enabled, Not Socketed, Level 2
        Operational Mode: Write Back
        Location: Internal
        Installed Size: 72 MB
        Maximum Size: 72 MB
        Supported SRAM Types:
                Unknown
        Installed SRAM Type: Unknown
        Speed: Unknown
        Error Correction Type: Unknown
        System Type: Unified
        Associativity: 8-way Set-associative

Handle 0x0005, DMI type 7, 27 bytes
Cache Information
        Socket Designation: L3 Cache
        Configuration: Enabled, Not Socketed, Level 3
        Operational Mode: Write Back
        Location: Internal
        Installed Size: 114 MB
        Maximum Size: 114 MB
        Supported SRAM Types:
                Unknown
        Installed SRAM Type: Unknown
        Speed: Unknown
        Error Correction Type: Unknown
        System Type: Unified
        Associativity: 12-way Set-associative

Handle 0x0006, DMI type 4, 50 bytes
Processor Information
        Socket Designation: G1:0.0
        Type: Central Processor
        Family: <OUT OF SPEC>
        Manufacturer: NVIDIA
        ID: 41 02 6B 03 02 00 00 00
        Version: Grace A02
        Voltage: Unknown
        External Clock: 1000 MHz
        Max Speed: 4000 MHz
        Current Speed: 3447 MHz
        Status: Populated, Enabled
        Upgrade: None
        L1 Cache Handle: 0x0003
        L2 Cache Handle: 0x0004
        L3 Cache Handle: 0x0005
        Serial Number: 0x00000001780A01860C00000016010200
        Asset Tag: <BAD INDEX>
        Part Number: <BAD INDEX>
        Core Count: 72
        Core Enabled: 72
        Thread Count: 72
        Characteristics:
                64-bit capable
                Execute Protection
                Arm64 SoC ID

Handle 0x0007, DMI type 9, 24 bytes
System Slot Information
        Designation: PCIe Slot 1
        Type: x16 PCI Express 5 x16
        Current Usage: In Use
        Length: Long
        ID: 1
        Characteristics: None
        Bus Address: 0000:01:00.0
        Data Bus Width: 13
        Peer Devices: 0
        PCI Express Generation: 5
        Slot Physical Width: x16
        Height: Full height

Handle 0x0008, DMI type 9, 24 bytes
System Slot Information
        Designation: PCIe Slot 2
        Type: x16 PCI Express 5 x16
        Current Usage: In Use
        Length: Long
        ID: 2
        Characteristics: None
        Bus Address: 0002:01:00.0
        Data Bus Width: 13
        Peer Devices: 0
        PCI Express Generation: 5
        Slot Physical Width: x16
        Height: Full height

Handle 0x0009, DMI type 9, 24 bytes
System Slot Information
        Designation: M.2 NVMe Drive Slot 1
        Type: x4 M.2 Socket 3
        Current Usage: In Use
        Length: Other
        Characteristics: None
        Bus Address: 0004:01:00.0
        Data Bus Width: 10
        Peer Devices: 0
        Slot Physical Width: x4
        Height: Other

Handle 0x000A, DMI type 9, 24 bytes
System Slot Information
        Designation: M.2 NVMe Drive Slot 2
        Type: x4 M.2 Socket 3
        Current Usage: In Use
        Length: Other
        Characteristics: None
        Bus Address: 0005:01:00.0
        Data Bus Width: 10
        Peer Devices: 0
        Slot Physical Width: x4
        Height: Other

Handle 0x000B, DMI type 9, 24 bytes
System Slot Information
        Designation: PCIe Slot 3
        Type: x16 PCI Express 5 x16
        Current Usage: In Use
        Length: Long
        ID: 5
        Characteristics: None
        Bus Address: 0006:01:00.0
        Data Bus Width: 13
        Peer Devices: 0
        PCI Express Generation: 5
        Slot Physical Width: x16
        Height: Full height

Handle 0x000C, DMI type 11, 5 bytes
OEM Strings

Handle 0x000D, DMI type 13, 22 bytes
BIOS Language Information
        Language Description Format: Abbreviated
        Installable Languages: 1
                enUS
        Currently Installed Language: enUS

Handle 0x000E, DMI type 16, 23 bytes
Physical Memory Array
        Location: System Board Or Motherboard
        Use: System Memory
        Error Correction Type: Single-bit ECC
        Maximum Capacity: 240 GB
        Error Information Handle: No Error
        Number Of Devices: 1

Handle 0x000F, DMI type 17, 92 bytes
Memory Device
        Array Handle: 0x000E
        Error Information Handle: 0x0000
        Total Width: 540 bits
        Data Width: 480 bits
        Size: 240 GB
        Form Factor: Die
        Set: None
        Locator: LP5x_0
        Bank Locator: LP5x_0
        Type: LPDDR5
        Type Detail: None
        Speed: 8532 MT/s
        Manufacturer: NVIDIA
        Serial Number: 9223381050307638979
        Asset Tag: Not Specified
        Part Number: Not Specified
        Rank: 1
        Configured Memory Speed: Unknown
        Minimum Voltage: 1.1 V
        Maximum Voltage: 1.1 V
        Configured Voltage: 1.1 V
        Memory Technology: DRAM
        Memory Operating Mode Capability: None
        Firmware Version: Not Specified
        Module Manufacturer ID: Bank 4, Hex 0x6B
        Module Product ID: Unknown
        Memory Subsystem Controller Manufacturer ID: Unknown
        Memory Subsystem Controller Product ID: Unknown
        Non-Volatile Size: None
        Volatile Size: 240 GB
        Cache Size: None
        Logical Size: None

Handle 0x0010, DMI type 19, 31 bytes
Memory Array Mapped Address
        Starting Address: 0x00080000000
        Ending Address: 0x03C800003FF
        Range Size: 240 GB
        Physical Array Handle: 0x000E
        Partition Width: 0

Handle 0x0011, DMI type 2, 17 bytes
Base Board Information
        Manufacturer: Not Specified
        Product Name: Not Specified
        Version: Not Specified
        Serial Number: Not Specified
        Asset Tag: Not Specified
        Features:
                Board requires at least one daughter board
                Board is replaceable
        Location In Chassis: Unknown
        Chassis Handle: 0xFFFE
        Type: System Management Module
        Contained Object Handles: 0

Handle 0x0012, DMI type 2, 17 bytes
Base Board Information
        Manufacturer: Not Specified
        Product Name: Not Specified
        Version: Not Specified
        Serial Number: Not Specified
        Asset Tag: Not Specified
        Features:
                Board requires at least one daughter board
                Board is replaceable
        Location In Chassis: Unknown
        Chassis Handle: 0xFFFE
        Type: Processor+Memory Module
        Contained Object Handles: 1
                0x000F

Handle 0x0013, DMI type 32, 11 bytes
System Boot Information
        Status: No errors detected

Handle 0x0014, DMI type 38, 18 bytes
IPMI Device Information
        Interface Type: SSIF (SMBus System Interface)
        Specification Version: 2.0
        I2C Slave Address: 0x08
        NV Storage Device Address: 0
        Base Address: 0x08 (SMBus)

Handle 0x0015, DMI type 41, 11 bytes
Onboard Device
        Reference Designation: Embedded Video Controller
        Type: Video
        Status: Enabled
        Type Instance: 1
        Bus Address: 0008:02:00.0

Handle 0x0016, DMI type 45, 24 bytes
Firmware Inventory Information
        Firmware Component Name: UEFI
        Firmware Version: buildbrain-gcid-36287995
        Firmware ID: UEFI
        Release Date: 2024-05-15T20:26:25+00:00
        Manufacturer: NVIDIA
        Lowest Supported Firmware Version: buildbrain-gcid-36287995
        Image Size: 64 MB
        Characteristics:
                Updatable: No
                Write-Protect: Yes
        State: Enabled
        Associated Components: 0

Handle 0x0017, DMI type 45, 24 bytes
Firmware Inventory Information
        Firmware Component Name: System ROM
        Firmware Version:         00020003
        Firmware ID: NVIDIA System Firmware
        Release Date: 20240516
        Manufacturer: NVIDIA
        Lowest Supported Firmware Version:         00020003
        Image Size: 64 MB
        Characteristics:
                Updatable: Yes
                Write-Protect: No
        State: Enabled
        Associated Components: 0

Handle 0x0018, DMI type 45, 26 bytes
Firmware Inventory Information
        Firmware Component Name: Full FW Image
        Firmware Version: 28.41.1000
        Firmware ID: Full FW Image
        Release Date: Not Specified
        Manufacturer: Not Specified
        Lowest Supported Firmware Version: 0x00000002
        Image Size: None
        Characteristics:
                Updatable: Yes
                Write-Protect: No
        State: Enabled
        Associated Components: 1
                0x0008

Handle 0x0019, DMI type 45, 24 bytes
Firmware Inventory Information
        Firmware Component Name: FW_FPGA_0
        Firmware Version: 0.96
        Firmware ID: FW_FPGA_0
        Release Date: Not Specified
        Manufacturer: Not Specified
        Lowest Supported Firmware Version: 0.96
        Image Size: Unknown
        Characteristics:
                Updatable: Yes
                Write-Protect: Yes
        State: Enabled
        Associated Components: 0

Handle 0xFEFF, DMI type 127, 4 bytes
End Of Table
```

## Linux cmdline

```
$ cat /proc/cmdline
BOOT_IMAGE=/boot/vmlinuz-6.8.0-45-generic root=UUID=5c1b9bd0-cddf-411f-8f05-a1ede3a45f78 ro audit=0 default_hugepagesz=2M hugepagesz=1G hugepages=32 hugepagesz=2M hugepages=32768 iommu.passthrough=1 isolcpus=1-71 nmi_watchdog=0 nohz_full=1-71 nosoftlockup processor.max_cstate=1 rcu_nocbs=1-71 cpufreq.off=1 cpuidle.off=1
```

## NVidia Grace Server Firmware Inventory

```
Host.           IPMI IP.      BMC.      BIOS. Cx-7 Firmware.  mlx5.
s36-t27-sut1.   10.30.50.36.  ?.        ?.    TBD.            24.04-0.7.0.0.
```