Software Reliability Engineer
Milk Moovement
This job is no longer accepting applications
See open jobs at Milk Moovement.See open jobs similar to "Software Reliability Engineer" VMG Partners.Other Engineering
Toronto, ON, Canada
Posted on Feb 5, 2025
ABOUT THE COMPANY
Milk Moovement is building a world-class team focused on getting the right milk to the right place at the right time.
Our growing herd of employees is driven to provide our clients with the data they need to make critical decisions that impact their operations and ultimately your favourite dairy products.
Who is Milk Moovement you might ask? We are a young VC-backed company with humble roots and massive ambitions to disrupt the dairy supply chain. We think differently, act nimbly, and always leave things better than we found them.
We're expanding our team to further our mission. Find us out on Twitter, Instagram, LinkedIn (@milkmoovement), and our home page to learn more or hit “apply” below!
THE ROLE
We’re hiring an SRE (Software Reliability Engineer) to join our team! At Milk Moovement, we service several cooperatives in the dairy industry who rely on our platform for analytics, invoicing, and daily usage by drivers and producers via our mobile applications. Your role will be to proactively monitor, detect, and resolve platform issues to ensure smooth and efficient operations. You will implement monitoring and alerting solutions, investigate performance anomalies, and help refine our incident response process. This role is critical in ensuring high system availability and performance while collaborating with Cloud Engineering, feature engineering, and product teams.
WHAT YOU’LL BE DOING
• Implement and maintain monitoring solutions using Datadog, focusing on proactive detection and resolution of platform issues.
• Develop alerting mechanisms that trigger based on symptoms rather than just outages, ensuring early detection of problems.
• Analyze system metrics, logs, and performance data to identify trends and potential reliability concerns.
• Lead incident response efforts, including triaging, troubleshooting, and post-mortem analysis for continuous improvement.
• Manage and optimize logging and monitoring infrastructure to ensure observability across all services.
• Work closely with development teams to ensure features are deployed with minimal impact on platform reliability.
• Participate in on-call rotations and incident management workflows, ensuring rapid issue response and resolution.
• Assist in cloud engineering tasks where necessary, particularly in reliability-focused automation and infrastructure improvements.
WHAT WE ARE LOOKING FOR
Milk Moovement seeks to have a diverse, inclusive, team-oriented, and curiosity-driven herd. Our technical team lives to find unique solutions to the challenges inherent to digital supply chains, and we expect you will be excited to do so as well. You must have at least 3 years prior SRE or DevOps experience, with a focus on the reliability side. Experience working in the dairy industry is not required! We will teach you all there is to know about the industry beginning with our Dairy 101 course. It is definitely more complicated than you think and that is why we do what we do!
REQUIRED
• Strong experience with log aggregation and monitoring solutions. (Datadog, Splunk, ELK)
• Experience working with monitoring cloud deployed applications. (AWS, GCP, Azure)
• Familiarity with configuring incident management platforms. (Squadcast, PagerDuty)
• Experience using IaC for deployment and management. (Terraform, CloudFormation, CDK)
• Proficiency in JavaScript or Python for automation and debugging.
• Extensive experience in troubleshooting & triaging performance issues and incidents.
PREFERRED
• Datadog certification or extensive experience configuring and tuning monitoring solutions.
• Related AWS certifications or ample experience administering AWS environments.
• Proficiency building internal tooling and APIs leveraging serverless infrastructure (Lambda)
• Experience working with container-based services. (Docker, ECS, Kubernetes)Working knowledge of both SQL and NoSQL databases, including troubleshooting and performance tuning. (MongoDB, PostgreSQL, DynamoDB)
• Familiarity with CI/CD processes and automation frameworks.
WHAT WE OFFER
🐮 Competitive salaries - we’re constantly reevaluating market trends to ensure we meet or exceed industry standards.
🐮 Equity - Stock option plan on a standard 4 year vesting schedule with a 1 year cliff.
🐮 Unlimited paid vacation and flex time - unlimited vacation can be vague and difficult to track; we strongly encourage everyone to take at least 2 weeks off per year plus public holidays. The rest is up to you.
🐮 Health (mental & physical), dental, & HSA coverage across North America.
🐮 Remote work environment - work from home or from one of our hubs in Halifax and St. John’s.
🐮 Flexible hours - night owl or early riser? No problem.
🐮 Tools - need the latest and great software to perform more efficiently? Ask and you shall receive.
🐮 Quarterly culture events - trivia, robot building, hackathons, etc. We like to keep it fresh and exciting.
ABOUT OUR CULTURE
🥛 We’ll drop everything to ensure our customers feel supported.
🥛 Transparency is ingrained in everything we do.
🥛 Respect is paramount.
🥛 We win and lose as a herd - lessons learned are equally as important as the wins.
🥛 We’re all in this together - our company wide thirst for knowledge is unquenchable.
🥛 Want to learn a bit more about what makes us moo-nique? Check out our About Us page for company mission, purpose, and values.
🥛 Did we mention we love puns?!
HOW TO APPLY
To apply, please submit your resume through our Careers page. Don't forget to complete our Get To Know The Candidate form; we love hearing what your favourite dairy products are!
We always conduct remote interviews to ensure accessibility. This role offers flexibility based on your location and work preferences, and we'll collaborate closely with you because we recognize that each individual has unique circumstances.
Don't meet every single requirement? Studies have shown that women and individuals from diverse backgrounds may hesitate to apply for positions unless they meet nearly every qualification. At Milk Moovement, we are deeply committed to enhancing our approach to creating a diverse, inclusive, and value driven workplace. If you’re excited about this role but your past experiences don’t align perfectly with our job description, we encourage you to apply anyway. You may well be the right candidate for this role or others!
Milk Moovement is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, disability, age, or other legally protected status. Milk Moovement is committed to providing reasonable accommodations for individuals with disabilities during the application and interview process. If you require an accommodation, please notify your Recruiter.
This job is no longer accepting applications
See open jobs at Milk Moovement.See open jobs similar to "Software Reliability Engineer" VMG Partners.