Senior Research Engineer, Olmo
Senior Research Engineer, OlmoSeattle, WAWho We AreWe are a non-profit AI institute, focused on developing foundational AI research and innovation to deliver real-world impact through large-scale open models, data, and artifacts (e.g., Olmo, Tulu, Molmo). We unite the best and brightest scientific and engineering minds to explore the potential of truly open AI. Through our efforts, including the pioneering Olmo releases, we endeavor to empower academics, researchers, and AI developers more broadly to advance language models and generative AI models. Through close collaboration, we rapidly identify, define, and act on the most exciting and promising new ideas in AI.Our team engages in a broad range of AI research, including pre-training and post-training language models, curating data to enhance AI across different modalities, and developing novel methodologies to push the field forward. We study and evaluate AI models both theoretically and empirically, aiming to advance their capabilities. Additionally, we create impactful real-world applications, such as in scientific synthesis. Our goal is to develop state-of-the-art models that excel in scientific discovery, reasoning, and factual recall.Who You AreYou are a talented, hands-on engineer who thrives in a fast-paced environment, is self-directed, a team player, and knows how to get things done. You have a deep knowledge of Python, infrastructure, and a strong understanding of modern deep learning, natural language processing, language models, and the inner workings of the transformer architecture. You can translate high-level goals into concrete research and implementation steps, set an approach, follow through, and present results. When it's time to explain your ideas, you bring clarity to complex technical issues. You use these skills to create real-world benefits for researchers and other practitioners, and you are excited to help advance our effort to create the best-performing open AI model.Your Next ChallengeYou will be a part of the core team of research and machine learning engineers working on the infrastructure, architecture, modeling and training of Olmo (Open Language Model) at all stages: pre-training, mid-training, post-training and all emerging paradigms. In this role you will be owning the design and implementation of the systems that train these models. You will be responsible for building scalable machine learning pipelines as we push the boundaries of large language modeling research. You will be collaborating with colleagues inside and outside your own team, but you are responsible for a feature or experiment from start to finish, from conception to implementation.The essential functions include, but are not limited to the following:Building infrastructure to facilitate the next generation of LLM researchOptimizing training and inference for language modelsTriaging between experiments and executing on the most impactfulSupporting and collaborating with an open-source communityBridging the gap between cutting-edge research and a widely adopted productBringing software engineering best practices to a research environmentReleasing your contributions back to the broader community in the form of open source software, model releases, and additions to Ai2's public API and open research datasets, as well as technical reportsWhat You'll NeedExpertise at building ML infrastructure - having 4+ years of industry experience building infrastructure that handles data preprocessing/transformation and model training, evaluation, inference, and deploymentDeep experience in the complete model development cycle, including data set construction, training, tuning, evaluation, performance profiling, and monitoringKnowledge of modern deep learning and natural language processing techniquesStrong software engineering skills, particularly around building performant systems and debuggingAt-home with hands-on programming – must have experience with Python and PyTorch/Jax/Tensorflow. We expect you to be the kind of engineer who can pick up a new programming language, library, or API as needed without it being a big deal.Familiarity working with cloud compute resources (e.g. AWS) and containerization (e.g. Docker)Strong collaboration and communication skills - our environment is small and collaborative, and we'd like you to thrive while working closely with others, sometimes with complementary skills/perspectivesBonus QualificationsAdvanced degree in Data Science/CS/EE/Applied Mathematics/Statistics/ML/NLP or related fields and/or relevant and equivalent engineering experienceContributions to open-source ML or research libraries (e.g. spaCy, AllenNLP, transformers)Experience successfully operating models at scale in a production settingExperience in HPC settingsCuriosity about AI researchEducationBS or MSc in Computer Science, Statistics, Engineering, Applied Mathematics, or a related quantitative fieldPhysical Demands and Work EnvironmentThe physical demands described here are representative of those that must be met by a team member to successfully perform the essential functions of this position. Reasonable accommodations may be made to enable individuals with disabilities to perform the functions.Must be able to remain in a stationary position for long periods of time.The ability to communicate information and ideas so others will understand. Must be able to exchange accurate information in these situations.The ability to observe details at close range.Can work under deadlines.A Little More About Ai2The Allen Institute for Artificial Intelligence is a non-profit research institute in Seattle founded by Paul Allen. The core mission of Ai2 is to contribute to humanity through high-impact research in artificial intelligence.In addition to Ai2's core mission, we also aim to contribute to humanity through our treatment of each member of the Ai2 Team. Some highlights are:We are a learning organization – because everything Ai2 does is ground-breaking, we are learning every day. Similarly, through weekly Ai2 Academy lectures, a wide variety of world-class AI experts as guest speakers, and our commitment to your personal on-going education, Ai2 is a place where you will have opportunities to continue learning alongside your coworkers.We value diversity – We seek to hire, support, and promote people from all genders, ethnicities, and all levels of experience regardless of age. We particularly encourage applications from women, non-binary individuals, people of color, members of the LGBTQA+ community, and people with disabilities of any kind.We value inclusion – We understand the value that people's individual experiences and perspectives can bring to an organization, and we are building a culture in which all voices are heard, respected and considered.We emphasize a healthy work/life balance – we believe our team members are happiest and most productive when their work/life balance is optimized. While we value powerful research results which drive our mission forward, we also value dinner with family, weekend time, and vacation time. We offer generous paid vacation and sick leave as well as family leave.We are collaborative and transparent – we consider ourselves a team, all moving with a common purpose. We are quick to cheer our successes, and even quicker to share and jointly problem solve our failures.We are in Seattle – and our office is on the water! We have mountains, we have lakes, we have four seasons, we bike to work, we have a vibrant theater scene, and we have so much else. We even have kayaks for you to paddle right outside our front door. We welcome interest from applicants from outside of the United States.We are friendly – chances are you will like every one of the 200+ people who work here. We do.Ai2 is proud to be an Equal Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. You may view the related Know Your Rights compliance poster and the Pay Transparency Nondiscrimination Provision by clicking on their corresponding links.This employer participates in E-Verify and will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S. If E-Verify cannot confirm that you are authorized to work, this employer is