Captions is the leading AI video company—our mission is to empower anyone, anywhere to tell their stories through video. Over 10 million creators and businesses have used Captions to simplify video creation with truly novel and groundbreaking AI capabilities.
We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company's culture.
Our Technology
Mirage Announcement our proprietary omni-modal foundation model
Seeing Voices (technical paper) generating A-roll video from audio with Mirage
Mirage Studio for generating expressive videos at scale
"Captions: For Talking Videos” available in the iOS app store
Press Coverage
Lenny’s Podcast: Interview with Gaurav Misra (CEO)
Latest Fundraise: Series C Announcement
The Information: 50 Most Promising Startups
Fast Company: Next Big Things in Tech
Business Insider: 34 most promising AI startups
TIME: The Best Inventions of 2024
Our Investors
We’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.
** Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square)
We do not work with third-party recruiting agencies, please do not contact us**
About the role:
We are seeking a talented Software Engineer with demonstrated experience in video processing to join our software engineering team. In this role, you will design and optimize video encoding pipelines, ensuring high-quality video processing that powers our video creation tools. Your work will directly impact the performance and scalability of our video platform, enabling seamless video creation experiences for users all over the world. 
Key Responsibilities:
Develop and optimize video encoding algorithms and workflows.
Implement efficient video processing pipelines for encoding, decoding, and transcoding.
Collaborate with cross-functional teams to improve video quality and performance.
Stay updated on industry trends and emerging video technologies.
Requirements:
Demonstrated expertise in video encoding technologies (e.g., H.264, HEVC, VP9, AV1).
Proficiency with video processing frameworks and libraries (e.g., FFmpeg, GStreamer).
Solid programming skills in Python, C++, or Rust (or related technologies).
Experience optimizing video encoding for performance and scalability.
Comprehensive medical, dental, and vision plans
401K with employer match
Commuter Benefits
Catered lunch multiple days per week
Dinner stipend every night if you're working late and want a bite!
Doordash DashPass subscription
Health & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)
Multiple team offsites per year with team events every month
Generous PTO policy
Captions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.
Please note benefits apply to full time employees only.