Software Engineer focusing on data collection and ingestion pipeline for AI model training at Speechify. Collaborating with Scientists to enhance data quality and cost-efficiency.
Responsibilities
Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.
Requirements
BS/MS/PhD in Computer Science or a related field.
5+ years of industry experience in software development.
Proficiency with bash/Python scripting in Linux environments
Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
Experience with web crawlers, large-scale data processing workflows is a plus
Ability to handle multiple tasks and adapt to changing priorities.
Strong communication skills, both written and verbal.
Benefits
A fast-growing environment where you can help shape the company and product.
An entrepreneurial-minded team that supports risk, intuition, and hustle.
A hands-off management approach so you can focus and do your best work.
An opportunity to make a big impact in a transformative industry.
Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
Opportunity to work on a life-changing product that millions of people use.
Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.
Staff Software Engineer specializing in data infrastructure for Instacart's data governance and compute systems. Collaborating with engineering teams to enhance the platform's reliability and performance.
Principal Engineer designing mixed - signal IPs for Microchip Technology. Collaborating with SoC architects and managing IP intake processes for advanced analog solutions.
Principal Software Architecture Director overseeing software architecture and technology strategy at SGI. Providing guidance and mentorship while aligning with business goals in the insurance sector.
Senior Engineer leading design and implementation of protective relaying systems for BWRX - 300 Nuclear Reactor. Engaging in grid interface projects and customer technical assessments.
Overseeing SAP AMS operations and leading SAP support teams remotely from Canada. Ensuring adherence to SLAs and managing vendor relationships for outsourced SAP support.
Software Engineer (No - Code) at All Gen Tech developing applications by collaborating with teams. A role that emphasizes problem solving and adaption to new technologies in a remote environment.
Technical Lead providing hands - on leadership for Canadian payment systems at Servus Credit Union. Driving integrations, technical oversight, and modernization of payment services in a cooperative environment.
Software Engineer building and expanding internal and external platforms for SecondMuse's mission - driven work. Focusing on full - stack development, systems integration, and practical AI solutions.