Research Scientist (Multiple Positions)
![]() | |
![]() United States, California, San Francisco | |
![]() | |
Research Scientist (Multiple Positions), Databricks, Inc., San Francisco, CA. Participate in conducting foundational research into solutions aimed at analyzing data using natural language through the design of large-scale distributed AI/ML systems, optimize distributed GPU model serving, or develop novel modeling methodologies that scale to production use cases. Assist in the development and deployment of state-of-the-art AI models and systems that impact the capabilities and performance of Databricks' products and services. Collaborate in architecting and implementing robust, scalable ML infrastructure to support seamless integration of AI/ML models into production environments. Develop novel data collection, fine-tuning, and pre-training strategies that achieve optimal performance on specific tasks and domains. Design and implement automated ML pipelines for data pre-processing, feature engineering, model training, hyperparameter tuning, and model evaluation, enabling rapid experimentation and iteration. Telecommuting permitted. (DBxCA052)
40 hrs/week, Mon-Fri, 8:30 a.m. - 5:30 p.m. Salary range: $282,800 - $292,800/yr.
MINIMUM REQUIREMENTS:
Must have a Master's degree (or foreign equivalent) in Computer Science, Engineering, Data Science, or a related field, plus 24 months of experience in machine learning or AI.
Of the required experience, must have 24 months of research experience in all of the following (which may be gained concurrently):
· Machine learning engineering experience or ML research; · Developing AI/ML systems at scale in production or in high-impact research environments; · Working with language modeling technologies, including developing generative and embedding techniques, modern model architectures, fine tuning/pre-training datasets, and evaluation benchmarks; · Software engineering principles around testing, code reviews, and deployment; and · Deploying and scaling language models in production
Must also have contributed to at least one (1) open-source AI/ML project, and have at least one (1) peer-reviewed publication in a leading professional journal.
Up to 10% travel (domestic) required.
To apply, please send resumes to USapplications@databricks.com and reference job code (DBxCA052).
|