Job Opportunity: Senior DevOps Engineer

About Civitai

Civitai is at the cutting-edge of AI and digital content creation, democratizing access to open-source AI resources on a global scale. We're looking for a seasoned Senior DevOps Engineer who can ensure the seamless operation of our IT infrastructure and the smooth deployment of our software solutions.

The Opportunity

As a Senior DevOps Engineer, you will play a foundational role in streamlining our operations and automating our processes. Your expertise will be key in maintaining our Kubernetes clusters, working with various cloud platforms such as AWS, Azure, Cloudflare, Digital Ocean, and Vultr. You will help us improve the stability and performance of our main Civitai web experience, expand our offerings, and enhance the productivity of our development teams.

Responsibilities

  • Infrastructure Management: Develop, deploy, and maintain various components of our infrastructure, including PostgreSQL, Kubernetes, Docker, RabbitMQ, Nginx, Clickhouse, and more.
  • CI/CD Pipelines: Improve our development efficiency and reliability by enhancing our CI/CD pipelines using GitHub Actions.
  • Collaboration: Collaborate with software engineers to ensure that our services are designed to be cloud-native and scalable.
  • System Architecture: Participate in architectural decisions and take charge of implementing scalable and resilient system architectures.
  • Monitoring: Proactively monitor system performance and identify potential issues before they become problems.
  • Best Practices: Develop and enforce best practices in release and deployment processes.
  • On-Call and Disaster Recovery: Participate in on-call rotation and disaster recovery drills.
  • Documentation: Keep our tech stack documentation up-to-date and contribute to internal knowledge sharing.

Qualifications

  • Bachelor's degree in Computer Science or a related field, or equivalent work experience.
  • Minimum of 5 years of experience in DevOps or related roles, with a focus on cloud infrastructure and operations.
  • Proficient knowledge of PostgreSQL, Docker, Kubernetes, and Nginx.
  • Experience setting up and managing RabbitMQ and Clickhouse.
  • Proficient experience with CI/CD pipelines, specifically with GitHub Actions.
  • Strong understanding of network protocols, IP networking, and system management.
  • Experience with scripting languages like Python, Bash, or similar.
  • Excellent communication skills, both verbal and written.
  • Experience in an AI or Machine Learning environment is a plus.
  • Familiarity with security best practices for cloud-based services is beneficial.
  • Any relevant certifications such as Certified Kubernetes Administrator (CKA), Certified Kubernetes Application Developer (CKAD), or Certified Kubernetes Security Specialist (CKS) are a plus.

Your Journey at Civitai

  • First 30 days: Familiarize yourself with our infrastructure, address immediate flaws, build improvement plans, and implement necessary monitoring tools.
  • First 60 days: Complete the plans from your initial analysis. Showcase your ability to improve and optimize our infrastructure.
  • First 90 days: Enhance the development experience for our team and open-source contributors, and provide necessary guides to get started.

What You Can Expect From Civitai

  • $166K - $198K annually, based on experience.
  • Fully remote position.

Join Civitai and be a part of our mission to democratize AI resources for all. We're excited to receive your application!

Apply HERE