Companies you'll love to work for

Sr. Data Scientist - NLP/GenAI

Semma Therapeutics

Semma Therapeutics

Software Engineering, Data Science
Boston, MA, USA
Posted on Wednesday, June 26, 2024

Job Description

Are you a data scientist with experience in natural language processing, machine learning, and advanced analytics, and looking to apply your expertise in a collaborative and intellectually stimulating environment? Are you passionate about the future of healthcare?

Our enterprise-wide data science, data engineering and product team is looking for a data scientist with experience in natural language processing and a passion for addressing business needs through data analysis and solutions. You will work on a highly collaborative, centralized team of data scientists, engineers, and strategists to deliver analytical insights and data products that drive value and impact for our highest priority business needs. You’ll work side-by-side with internal partners across development, clinical, commercial, and general and administrative areas to develop creative NLP solutions that contribute meaningfully to our business and patients.

Key Duties and Responsibilities:

  • Collaborate with a centralized team of data scientists/engineers/strategists and cross-functional partners to conceptualize and deploy data science solutions for business problems using NLP (including but not limited to large language models), and to design and execute A/B tests to quantify the value of these solutions
  • Build and deliver compelling data visualizations and outputs to communicate findings to technical collaborators, non-technical audiences, and business leaders
  • Participate in the broader data science community to stay current with methodology, software, and data development and availability
  • Bring an entrepreneurial and ethical mindset, openness, transparency, and collegiality to your work

Minimum Qualifications:

  • Bachelor’s, Master’s, or PhD degree in a computational or quantitative discipline, including but not limited to data science, statistics, computer science, computational linguistics, biomedical informatics, neuroscience, physics, epidemiology, health economics
  • 2+ years of experience developing and/or applying ML/NLP solutions in an industry or academic context
  • Expertise in programming languages (e.g. Python, R, SQL, JavaScript), version control, and other data science related tools (e.g. Shiny, D3, AWS, dbt)
  • Expertise in working with natural language data and building text-based products, using both classic and state of the art NLP techniques (e.g. text mining, word embeddings, transformer-based models)
  • Experience with LLM prompt engineering and familiarity with LLM-based workflows/architectures such as retrieval-augmented generation
  • Experience with statistical/analytical methodologies and algorithms (e.g. classification, regression, clustering, feature selection/engineering, deep learning, time-series analysis, network analysis, hypothesis testing)
  • Exceptional communication skills and ability to present findings to non-technical audiences
  • Experience in effective data visualization approaches and a keen eye for detail in the visual communication of findings
  • Demonstrated history of adherence to highest standards of data ethics

Preferred Qualifications:

  • 3+ years of industry NLP data science experience
  • Familiarity with data product UX/UI design and testing
  • Familiarity with LLMOps, including deployment, monitoring, and maintenance of data solutions
  • Prior experience with using advanced analytics and/or developing advanced data visualizations and dashboards (e.g. R Shiny) in business settings
  • Prior exposure to clinical data, real-world data (EMR, claims), manufacturing or supply chain data, or life sciences-related research data
  • Knowledge of the biopharma or healthcare industry

Flex Designation:

Hybrid-Eligible Or On-Site Eligible

Flex Eligibility Status:

In this Hybrid-Eligible role, you can choose to be designated as:
1. Hybrid: work remotely up to two days per week; or select
2. On-Site: work five days per week on-site with ad hoc flexibility.

Note: The Flex status for this position is subject to Vertex’s Policy on Flex @ Vertex Program and may be changed at any time.

Company Information

Vertex is a global biotechnology company that invests in scientific innovation.

Vertex is committed to equal employment opportunity and non-discrimination for all employees and qualified applicants without regard to a person's race, color, sex, gender identity or expression, age, religion, national origin, ancestry, ethnicity, disability, veteran status, genetic information, sexual orientation, marital status, or any characteristic protected under applicable law. Vertex is an E-Verify Employer in the United States. Vertex will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.

Any applicant requiring an accommodation in connection with the hiring process and/or to perform the essential functions of the position for which the applicant has applied should make a request to the recruiter or hiring manager, or contact Talent Acquisition at