Research Scientist – Large Language Models
il y a 6 jours
Join our research team to solve information extraction 🙂 Recent PhD required You need to be an ML, NLP, and LLM expert We are looking for a Research Scientist out of PhD to create LLMs & VLMs such as NuExtract and NuMarkdown to power the https://nuextract.ai/ platform. Your job will involve creating datasets, training LLMs, performing experiments / ablation studies, and so on. Check the list of typical topics below. You will join a team of brilliant ML scientists supervised by our CEO (https://www.linkedin.com/in/etiennebcp/). We are a 3-years-old AI startup with 12 employees located in Station F, Paris. We did YCombinator. We have a hybrid work model -- you should be able to work from our office regularly (at least once a week). Requirements You should be out of PhD or post-doc. You should have an ML/NLP/LLM background. You should be self-driven, creative, passionate about ML/NLP/LLMs. You should have both a researcher and a hacker/builder mindset. You should like to work in a startup environment (fast pace, frequent changes of directions). Responsibilities Training task-specific LLMs Running experiments/ablation studies Developing software related to LLMs Staying up to date with relevant LLM & NLP research Typical R&D topics we are working on 1. Extraction Confidence Users ofNuExtract.ai want to be able to quickly verify the validity of extracted values in the JSON output. To do so, they need to know which values NuExtract is confident about, and which ones it is not. We want to figure out how we can get an uncertainty score for the extraction values of NuExtract. This is not trivial due to multiplicity of correct answers and correlations between answers. Users ofNuExtract.ai want to be able to quickly verify the validity of extracted values. To do so, they need to know where, in the document, the information is coming from (or deduced from). We want to figure out how to do this. 3. Long Document Extraction LLMs have a limited context length which limits document size. We want to figure out how NuExtract could extract information from documents much longer than its context length. 4. Reasoning for Structured Extraction We want to train NuExtract able to reason via private chain of thoughts about its extraction. 5. Extraction Agent We want to provide a reasoning NuExtract the ability of using tools (e.g. zooming on document or performing a web search) in order to improve extraction quality. 6. Structured Extraction Benchmark There is no public benchmark for structured extraction. We want to create such benchmark and make it public. Links Platform: https://nuextract.ai/ GitHub: https://github.com/numindai Discord: https://discord.com/invite/3tsEtJNCDe NuNER paper: https://arxiv.org/abs/2402.15343 Referrals increase your chances of interviewing at NuMind (YC S22) by 2x Get notified about new Research Scientist jobs in Paris, Île-de-France, France. #J-18808-Ljbffr
-
Research Scientist, Open-Weights LLM Pretraining
il y a 1 semaine
Paris, France DeepMind Temps pleinA leading AI research organization in Paris is seeking a passionate Research Scientist to drive advancements in large language models (LLMs). This role involves developing and evaluating next-gen models, collaborating with experts across teams, and contributing to projects that significantly impact Google products and services. To succeed, candidates should...
-
Research Scientist, Pretraining, Gemma
il y a 1 semaine
Paris, France DeepMind Temps pleinSnapshot Come join our team focused on advancing the state‑of‑the‑art open‑weights large language models (LLMs). You will conduct cutting‑edge research particularly in the multimodal domain (speech, vision and text) with a direct path to impacting billions of users through Google products. About Us At Google DeepMind we are a team of scientists,...
-
Research Scientist
il y a 2 semaines
Paris, France Meta Temps plein**Research Scientist Responsibilities**: - Lead research to advance the science and technology of intelligent machines - Lead research that enables learning the semantics of data (images, video, text, audio, speech and other modalities) - Devise better data-driven models of human behavior - Work towards long-term ambitious research goals, while identifying...
-
Research Scientist, Nlp
il y a 2 semaines
Paris, France Meta Temps plein**Research Scientist, NLP Responsibilities**: - Lead research that extends the capabilities of foundation models through reasoning and the use of tools - Work towards long-term ambitious research goals, while identifying intermediate milestones. - Influence progress of relevant research communities by producing publications. - Contribute research that can...
-
Research Scientist Lead, Nlp
il y a 2 semaines
Paris, France Meta Temps plein**Research Scientist Lead, NLP Responsibilities**: - Lead research that extends the capabilities of foundation models through reasoning and the use of tools - Shape and work towards long-term ambitious research goals, while identifying intermediate milestones. - Influence progress of relevant research communities by producing publications. - Contribute...
-
AI Research Scientist
il y a 1 semaine
Paris, France Lexsi Labs Temps pleinJoin to apply for the AI Research Scientist (PhD) role at Lexsi Labs The Lexsi Lab is a premier research lab dedicated to solving one of the most critical challenges of our time: ensuring that advanced artificial intelligence systems are safe, transparent, and aligned with human values. Founded by pioneers in the deep learning space, our lab operates at the...
-
Ai - Research Scientist (Leadership)
il y a 6 jours
Paris, France Meta Temps plein**AI - Research Scientist (Leadership) Responsibilities**: - Influence and drive strategy for long-term ambitious research to advance the science and technology of intelligent machines - Form internal partnerships, influence, resolve conflict, build alignment, and work effectively across teams and disciplines throughout the company to execute the strategy -...
-
AIML - ML Researcher, Foundation Models
il y a 1 semaine
Paris, France Apple Inc. Temps pleinParis, Ile-de-France, France Machine Learning and AIWe are a group of engineers and researchers responsible for building foundation models at Apple. We build infrastructure, datasets, and models with fundamental general capabilities such as understanding and generation of text, images, speech, videos, and other modalities and apply these models to Apple...
-
AIML - ML Research Engineer, Foundation Models
il y a 1 semaine
Paris, France Apple Temps pleinOverview We are a group of engineers and researchers responsible for building foundation models at Apple. We build infrastructure, datasets, and models with fundamental general capabilities such as understanding and generation of text, images, speech, videos, and other modalities and apply these models to Apple products! Description We believe that the most...
-
Software Engineer, Language
il y a 2 semaines
Paris, France Meta Temps plein**Software Engineer, Language - Generative AI Responsibilities**: - Collaborate, and execute on research that pushes forward the state of the art in responsible large language model research - Directly contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results - Develop methodology...