Senior Systems Reliability Engineer
Company: DRW Holdings, LLC.
Location: Chicago
Posted on: November 13, 2024
Job Description:
DRW is a diversified trading firm with over 3 decades of
experience bringing sophisticated technology and exceptional people
together to operate in markets around the world. We value autonomy
and the ability to quickly pivot to capture opportunities, so we
operate using our own capital and trading at our own risk.
Headquartered in Chicago with offices throughout the U.S., Canada,
Europe, and Asia, we trade a variety of asset classes including
Fixed Income, ETFs, Equities, FX, Commodities and Energy across all
major global markets. We have also leveraged our expertise and
technology to expand into three non-traditional strategies: real
estate, venture capital and cryptoassets. We operate with respect,
curiosity and open minds. The people who thrive here share our
belief that it's not just what we do that matters-it's how we do
it. DRW is a place of high expectations, integrity, innovation and
a willingness to challenge consensus. - We are seeking a Systems
Reliability Engineer to join our Fixed Income Commodities and
Currency Options (FICCO) and Cumberland team in either Chicago or
London. In this role, you will be responsible for designing and
supporting highly available systems within a technologically
diverse stack used for global research and trading of FICCO and
Cryptoassets. Leveraging tools such as AWS, Docker, Kubernetes,
CI/CD, Python, Prometheus and Grafana, you will develop a
repeatable and supportable tech stack to meet the demanding needs
of our business. Core Responsibilities:
- Collaborate with our FICCO and Cumberland technology and
trading teams regarding their CI/CD processes.
- Collaborate with development teams to troubleshoot software
build issues and optimize packaging processes.
- Automate deployment processes to improve efficiency and reduce
manual intervention.
- Implement and manage infrastructure as code tools such as
Terraform and Ansible.
- Maintain, design, and troubleshoot our observability
stack.
- Drive initiatives to modernize environments by developing and
optimizing -processes using appropriate cloud and container tools,
such as AWS and Kubernetes.
- Consistently challenge the norm and advocate for change Skills
and Qualifications
- Proven experience as a DevOps Engineer, Site Reliability
Engineer, or similar software engineering role
- Strong expertise with Observability tools such as Prometheus,
Alert Manager, Grafana, Sentry, and OpenTelemetry
- Proficiency with Python, Java, and C++ software builds and
packaging
- Hands-on experience with CI/CD tools like TeamCity, Concourse,
Argo Workflows, and/or GitHub Actions
- Solid understanding of Infrastructure as Code (IaC) tools such
as Terraform, Terragrunt, and Ansible
- Skills in Python for troubleshooting and maintaining
environment dependencies
- Proficient with Docker for image creation, networking, and
execution
- Experienced with Kubernetes, including deployment and
management of applications
- Knowledge of ArgoCD, Helm, and Kustomize for Kubernetes
application management
- Fundamental understanding of git and familiarity with git
repository tools such as GitHub and GitLab
- Linux experience with Debian and Redhat-based systems
- Excellent organizational skills, with the ability to
effectively plan and prioritize tasks
- Strong collaborative team spirit and communication skills
Preferred Qualifications
- Bachelor's degree in Computer Science, Engineering, or a
relevant field
- Experience using Conda, including environment management and
conda-build for creating conda packages
- Experience deploying and maintaining CI/CD pipelines in a
large-scale production environment
- Hands-on experience with cloud platforms and services, such as
AWS, GCP, or Azure
- Experience supporting the infrastructure and systems that
facilitate electronic trading functions or other high-performance
computing environments
- Experience in consolidating diverse and redundant approaches to
common problems For more information about DRW's processing
activities and our use of job applicants' data, please view our
Privacy Notice at https://drw.com/privacy-notice .
#J-18808-Ljbffr
Keywords: DRW Holdings, LLC., Elkhart , Senior Systems Reliability Engineer, Other , Chicago, Indiana
Didn't find what you're looking for? Search again!
Loading more jobs...