Amazon CloudWatch gives customers actionable visibility into the health of their applications and services. Today we run one of the largest time-series data stores on the planet and teams in CloudWatch solve problems of massive metric ingestion, distributed systems/cloud computing, data visualization, log processing and analytics, anomaly detection, and predictive analytics. Our services are the “eyes and ears” for customers everywhere, both inside Amazon along with AWS customers across the world.
We are looking for systems and devops engineers that are passionate about all things operations and know how to drive automation and excellence into every aspects of building, deploying, running, and scaling our gigantic cloud services. CloudWatch has a fast-paced environment where we “Work Hard, Have Fun, Make History.” On a typical day, our systems engineers might deep dive to root cause a customer issue, investigate why a metric is trending the wrong way, consult with the top engineers at Amazon, or discuss radical new approaches to automate operational issues. You'll be surrounded by people who are wickedly smart, passionate about monitoring, and believe that we are only scratching the surface of what CloudWatch can really do to help application and service owners everywhere.
Bachelors Degree in Computer Science or a related field, or relevant work experience
· Working knowledge of the Linux operating system and basic system tools
· 5+ years building and running systems for high availability Internet-facing services
· 5+ years of development of systems management and administration automation in Perl, Python, Ruby, or Java
· Excellent troubleshooting skills
· Excellent communication skills and the ability to work well in a team
Advanced degree in Computer Science or an Engineering discipline
Prior experience in a monitoring domain including infrastructure monitoring, application performance management (APM), network performance monitoring (NPM), etc.
Experience with massively scaled distributed systems
Experience with Linux performance testing, profiling, and tuning
Working knowledge of SQL and database administration basics
Knowledge of TCP/IP networking, architecture, and core technologies (such as DNS, DHCP, HTTP, Routing, VPN)