mirror of
https://github.com/netbootxyz/netboot.xyz.git
synced 2026-04-26 07:35:57 +03:00
[GH-ISSUE #1255] netboot.xyz not loading in Equinix Metal #367
Labels
No labels
Hacktoberfest
Hacktoberfest
bootloader
bsd
bug
confirmed
documentation
duplicate
enhancement
enhancement
enhancement
eol
experimental-merged
freebsd
help wanted
invalid
investigate
ipxe
linux
live-os
memdisk
menu
no-issue-activity
no-issue-activity
pull-request
released
todo
upstream
windows
windows
work-in-progress
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/netboot.xyz#367
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @enkelprifti98 on GitHub (Jun 7, 2023).
Original GitHub issue: https://github.com/netbootxyz/netboot.xyz/issues/1255
Originally assigned to: @antonym on GitHub.
Describe the bug
Deploying Equinix Metal instances with custom ipxe and
https://boot.netboot.xyzdoes not work. It gets stuck chainloading the netboot.xyz image.To Reproduce
Steps to reproduce the behavior:
https://boot.netboot.xyzExpected behavior
The netboot.xyz image should load.
@vielmetti commented on GitHub (Jun 7, 2023):
Happy to provide support for anyone interested in looking into this.
@antonym commented on GitHub (Jun 7, 2023):
Any particular machine type this is occurring on or specific architecture? Have you tried loading older releases instead of rolling? Was it working before?
@enkelprifti98 commented on GitHub (Jun 7, 2023):
@antonym Seems to be mainly the m3.small.x86 instance type affected. Others like the c3.small seem to work fine.
@antonym commented on GitHub (Jun 13, 2023):
if the implementation is working on other instance types of the same arch, I don't see why it wouldn't work unless there is something new with the hardware. If it was working previously, it could be an iPXE change upstream that rolled in, and then we could try booting older releases to see if anything works on that type.
@jackson-tim commented on GitHub (Jun 13, 2023):
This also seems to be happening on the n3.xlarge instances, maybe it's a common NIC type? e810s maybe?
@vielmetti commented on GitHub (Jun 29, 2023):
@antonym I have added some credits to your Equinix account, to assist in any testing. Unclear to me at this point what the scope of this problem is right now.
@enkelprifti98 commented on GitHub (Jun 29, 2023):
Hi @antonym @vielmetti ! I did some more testing and I believe this is an issue with nodes that have the Intel E810 NIC.
I deployed 5x m3.small.x86 instances. The ones with Mellanox NICs get to the netboot.xyz menu just fine. The ones with the Intel E810 NIC get stuck on the following screen seen through the Out-of-Band serial console.
@antonym commented on GitHub (Jun 29, 2023):
I saw this link https://github.com/ipxe/ipxe/issues/498 but not sure if there was any resolution. Any idea what the firmware of the NIC might be on and if there is a newer rev available from Intel?
@antonym commented on GitHub (Jun 30, 2023):
Also found this thread:
https://github.com/ipxe/ipxe/issues/937
I'll try and create some snp efi images this weekend and we can see if it provides a better result.
@antonym commented on GitHub (Jul 1, 2023):
Try this url for custom_url to see if you have any issues:
I've added building of the snp EFI kernels and also have set the default to use snp kernels as the default on everything as it should avoid using the built in iPXE drivers that are not always kept up to date as newer hardware is deployed.
I've tested on one machine that was having the issue and it seemed to start loading after that. I'm not sure of the wider impact on other machine types yet, but it should be a more reliable path to go.
github.com/netbootxyz/netboot.xyz@854df13accgithub.com/netbootxyz/netboot.xyz@da3e1d9243This is mainly in development now and not pushed to production yet, will give it a bit of time to bake so others can test with.
@antonym commented on GitHub (Jul 6, 2023):
Closing this out as I think this should be resolved and work better going forward, please ping me if you notice any oddness. Thanks to @vielmetti for the testing credits!
@vielmetti commented on GitHub (Jul 6, 2023):
Thanks @antonym for checking this out - the credits should be ample for any reasonable test setup that is necessary now or in the future.
@enkelprifti98 commented on GitHub (Jul 11, 2023):
Tested an E810 node with
https://boot.netboot.xyzand it works fine now. I guess the change is live in production at this point. Thanks for addressing the issue!