Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Helix Queue for ARM64 Cedar Crest hardware #3343

Closed
1 of 3 tasks
JpratherMS opened this issue Jul 10, 2024 · 15 comments
Closed
1 of 3 tasks

New Helix Queue for ARM64 Cedar Crest hardware #3343

JpratherMS opened this issue Jul 10, 2024 · 15 comments
Assignees
Labels
Ops - Service Maintenance Used to track issues related to maintaining the services .NET Eng Supports Ops - Spike Work items to be included in our Ops Spike

Comments

@JpratherMS
Copy link

JpratherMS commented Jul 10, 2024

We have a requirement from Julie Lee to get this new hardware into Helix. I believe we will need a new queue for this and some testing to make sure the hardware will work. I think its likely we will be using a specific version of Server 2025.

Release Note Category

  • Feature changes/additions
  • Bug fixes
  • Internal Infrastructure Improvements

Release Note Description

Added new test queues for ARM64 Cedar Crest hardware

@riarenas riarenas added the Ops - Service Maintenance Used to track issues related to maintaining the services .NET Eng Supports label Jul 10, 2024
@ilyas1974 ilyas1974 added the Ops - P1 Operations task, priority 1 (highest priority) label Jul 24, 2024
@ilyas1974
Copy link
Contributor

After talking with @JpratherMS earlier today, it appears that we need 4 new queues to support this hardware

windows.2025
ubuntu.2204
windows.2025.perf
ubuntu.2204.perf

@ilyas1974 ilyas1974 added Ops - Spike Work items to be included in our Ops Spike and removed Ops - P1 Operations task, priority 1 (highest priority) labels Jul 25, 2024
@ilyas1974
Copy link
Contributor

All 4 queues need to be associated with the internal project.

@riarenas riarenas self-assigned this Aug 20, 2024
@riarenas
Copy link
Member

@ilyas1974 what are the expected matrix of truth properties for these queues?

@ilyas1974
Copy link
Contributor

I believe the OS type needs to exist in os-definitions.json whenever we add a new OS to helix, but I'm not 100% sure.

@riarenas
Copy link
Member

I believe that's the case as well. What would be the values for these systems?

@riarenas
Copy link
Member

riarenas commented Aug 21, 2024

Are there any existing queues that these new queues should be based on? regarding artifacts needed, etc.

@ilyas1974
Copy link
Contributor

I would base these queues on the existing windows server 2022 queues as well as existing ubuntu 22.04 queues. As these are on-premises machines, the ubuntu.2204.amd64.viper.perf and windows.11.amd64.viper.perf may be the best place to start.

@ilyas1974
Copy link
Contributor

Full names of the queues should be

windows.2025.arm64.cedarcrest
ubuntu.2204.arm64.cedarcrest
windows.2025.arm64.cedarcrest.perf
ubuntu.2204.arm64.cedarcrest.perf

@riarenas
Copy link
Member

The above queues should now be available in our staging environment if we want to validate some of the hardware there. The queues will be available in the production environment as part of our next deployment

@ilyas1974
Copy link
Contributor

The linux install of helix was successful.

The Windows install did produce an error
Image

cc: @v-parose

@riarenas
Copy link
Member

Going to start taking a look today.

@ilyas1974 or @v-parose do either of you have the full set of instructions that are being followed? The requested blob seems to be in the storage account, so this 404 might be some authentication problems in disguise. Having access to the full output in a searchable (non image) form would also help find what might be going on.

@riarenas
Copy link
Member

I have the information now and I'm taking a look at what might be going on here.

@riarenas
Copy link
Member

I think I tracked this down to a bad sas that isn't being tracked by secret manager. I attempted to fix it manually to see if that helps. While we try to set up a machine again, I'll double check that this sas is properly managed going forward.

@riarenas
Copy link
Member

@v-parose has confirmed that after the sas fixup he was able to provision a windows machine for this queue setup. #4163 covers the follow up to manage this sas properly.

@riarenas
Copy link
Member

as we have confirmation that these systems can be set up in Helix, I'm closing this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Ops - Service Maintenance Used to track issues related to maintaining the services .NET Eng Supports Ops - Spike Work items to be included in our Ops Spike
Projects
None yet
Development

No branches or pull requests

3 participants