Senior Software Engineer - Data
BenchSci
You Will:
- Scale data pipelines to allow our data to go from research to platform as fast as possible
- Collaborate with ML and Data engineers to solve complex document mining challenges, enabling us to capture and model additional scientific experiments
- Develop data access mechanisms for downstream applications consumption
- Expand the types of biological reagents and experiments that we can successfully extract from various types of media
- Manage sources that contain both semi-structured and unstructured data
- Apply state-of-the-art data processing engines to rapidly process our data sets
- Employ best practices for a broad platform of technologies within a cloud-based environment
- Provide troubleshooting analysis and resolution in a timely manner
- Work on projects involving some of the largest pharmaceutical companies in the world
- Solve difficult problems and bring new perspectives to the team
You Have:
- Degree in Software Engineering, Computer Science, or a similar area
- 4+ years of experience working as a professional software engineer, data in industry
- Expertise with Python and SQL
- Experience with Retrieval Augmented Generation (RAG)
- Experience with cloud development (we use GCP and Terraform) including reference architectures and developing specialized stacks on cloud services
- A strong background in data structures and large-scale data manipulation
- Strong cross-team communication and collaboration skills
