DataDog SME Job at Scicom Infrastructure Services, Atlanta, GA

emdBc3BEalJVSFBvNHA0N0FkcERweTB1RGc9PQ==
  • Scicom Infrastructure Services
  • Atlanta, GA

Job Description

Job Description

Job Description

Salary:

Overview :

We are seeking a highly skilled and experienced Datadog Subject Matter Expert (SME) to join our team. In this role, you will leverage your in-depth knowledge of Datadog’s monitoring, observability, and cloud infrastructure capabilities to guide the implementation, optimization, and maintenance of Datadog solutions across our systems. The ideal candidate will have extensive experience with Datadog’s features, including APM, log management, infrastructure monitoring, and cloud security.

This position will require close collaboration with cross-functional teams, including DevOps, IT, engineering, and security, to ensure Datadog is being used effectively to monitor system performance, diagnose issues, and enhance overall operational efficiency.

Key Responsibilities :

  • Datadog Implementation and Integration :
    Lead the deployment and integration of Datadog solutions across various cloud environments (AWS, Azure, GCP) and on-premise systems. Ensure Datadog agents, monitors, dashboards, and integrations are configured according to best practices.
  • Monitoring and Troubleshooting :
    Provide subject matter expertise in configuring Datadog’s monitoring tools for infrastructure, applications, logs, and security. Troubleshoot complex issues related to system performance, application logs, and infrastructure alerts using Datadog.
  • Optimization :
    Continuously monitor and optimize Datadog usage to ensure efficient use of resources, minimize costs, and maximize the value derived from the platform. Recommend and implement improvements to existing Datadog setups.
  • Custom Dashboards & Alerts :
    Design, create, and maintain custom Datadog dashboards and alerting systems tailored to the needs of different teams and stakeholders. Ensure alert thresholds, notification channels, and escalation paths are properly configured.
  • Collaboration & Knowledge Sharing :
    Act as the primary point of contact for Datadog-related inquiries. Work closely with DevOps, IT, and engineering teams to ensure Datadog is aligned with business goals. Provide training and mentorship to junior team members.
  • Data Analysis & Reporting :
    Analyze data collected from Datadog to provide actionable insights into application performance, infrastructure health, and overall system reliability. Generate regular reports and provide recommendations for process improvements.
  • Best Practices & Documentation :
    Establish and enforce Datadog best practices for monitoring, alerting, and dashboard design. Create and maintain comprehensive documentation for Datadog configurations and integrations.
  • Stay Current with Datadog Updates :
    Keep up to date with the latest Datadog features, releases, and industry trends. Provide recommendations for adopting new tools or features that can enhance system observability.

Qualifications :

  • Technical Skills :
    • Deep knowledge of Datadog, including its full suite of monitoring and observability tools (Infrastructure Monitoring, APM, Log Management, Synthetics, etc.).
    • Experience with cloud platforms (AWS, Azure, GCP) and container orchestration systems (Kubernetes, Docker).
    • Familiarity with programming/scripting languages (e.g., Python, Bash, Go) for automating and customizing Datadog tasks.
    • Strong understanding of network protocols, databases, and application performance management (APM).
    • Experience with CI/CD pipelines, DevOps tools, and integration of Datadog in modern software delivery workflows.
  • Experience :
    • Minimum of [X] years of experience in a monitoring, DevOps, or site reliability engineering (SRE) role, with at least [Y] years of hands-on experience specifically with Datadog.
    • Proven experience deploying and configuring Datadog in complex, distributed environments.
    • Experience in troubleshooting performance bottlenecks, diagnosing application errors, and providing root cause analysis.
  • Soft Skills :
    • Strong problem-solving and analytical skills with the ability to quickly understand complex systems.
    • Excellent communication skills, with the ability to translate technical concepts for non-technical stakeholders.
    • Ability to work independently and as part of a collaborative team.
    • Strong organizational skills and attention to detail.

Preferred :

  • Datadog certifications (e.g., Datadog Certified Expert).
  • Salesforce experience preferred.
  • Experience with log aggregation, APM, and synthetic monitoring.
  • Familiarity with cloud security monitoring using Datadog.
  • Experience with infrastructure-as-code (IaC) tools like Terraform or CloudFormation.
remote work

Job Tags

Remote job,

Similar Jobs

NAPPR

Early Intervention - Program Director Job at NAPPR

 ...Description Job Description Are you a program/project manager type? LOOK HERE! If you have experience in an early childhood profession or early intervention specifically, all the better! We are looking for someone who is a hard worker and not intimidated by managing... 

Hyphen Digital Experience

UX/UI Designer (m/f) Job at Hyphen Digital Experience

About the jobOur client, a global IT leader, is seeking skilled UX Designers to join their team. This role offers a unique path: youll start with us in a remote work setting and after six months, youll relocate to Geneva, Switzerland to work directly with the client... 

Ross Stores

College Recruiter Job at Ross Stores

 ...committed to providing an inclusive work environment with continuous learning opportunities and development for our teams. College Recruiter About this opportunity... The College Recruiter is responsible for supporting Ross Stores' growth plans by attracting and... 

Thom Child And Family Services

Developmental Specialist - Early Intervention Job at Thom Child And Family Services

 ...Thom Child & Family Services located in Springfield, MA is seeking a Developmental Specialist/Early Childhood Educator to join their team. As an Early Intervention Early Childhood Educator or Developmental Specialist , you will be responsible for providing home... 

Knowhirematch

Forensic Engineer - Civil/Structural Job at Knowhirematch

 ...Job Description Job Description We are Recruiting on Purpose a Civil/Structural Forensic Engineer for our client's Columbia, SC Practice. This role is can be a hybrid role working at the Office and from home. This is a terrific opportunity with one of the leading...