Croissant: a metadata format for ML-ready datasets

Posted by Omar Benjelloun, Software Engineer, Google Research, and Peter Mattson, Software Engineer, Google Core ML and President, MLCommons Association Machine learning (ML) practitioners looking to reuse existing datasets to train an ML model often spend a lot of time understanding the data, making sense of its organization, or figuring out what subset to use […]

Croissant: a metadata format for ML-ready datasets Lire l’article »

Social learning: Collaborative learning with large language models

Posted by Amirkeivan Mohtashami, Research Intern, and Florian Hartmann, Software Engineer, Google Research Large language models (LLMs) have significantly improved the state of the art for solving tasks specified using natural language, often reaching performance close to that of people. As these models increasingly enable assistive agents, it could be beneficial for them to learn

Social learning: Collaborative learning with large language models Lire l’article »

Health-specific embedding tools for dermatology and pathology

Posted by Dave Steiner, Clinical Research Scientist, Google Health, and Rory Pilgrim, Product Manager, Google Research There’s a worldwide shortage of access to medical imaging expert interpretation across specialties including radiology, dermatology and pathology. Machine learning (ML) technology can help ease this burden by powering tools that enable doctors to interpret these images more accurately

Health-specific embedding tools for dermatology and pathology Lire l’article »

Chain-of-table: Evolving tables in the reasoning chain for table understanding

Posted by Zilong Wang, Student Researcher, and Chen-Yu Lee, Research Scientist, Cloud AI Team People use tables every day to organize and interpret complex information in a structured, easily accessible format. Due to the ubiquity of such tables, reasoning over tabular data has long been a central topic in natural language processing (NLP). Researchers in

Chain-of-table: Evolving tables in the reasoning chain for table understanding Lire l’article »

Talk like a graph: Encoding graphs for large language models

Posted by Bahare Fatemi and Bryan Perozzi, Research Scientists, Google Research Imagine all the things around you — your friends, tools in your kitchen, or even the parts of your bike. They are all connected in different ways. In computer science, the term graph is used to describe connections between objects. Graphs consist of nodes

Talk like a graph: Encoding graphs for large language models Lire l’article »

Cappy: Outperforming and boosting large multi-task language models with a small scorer

Posted by Yun Zhu and Lijuan Liu, Software Engineers, Google Research Large language model (LLM) advancements have led to a new paradigm that unifies various natural language processing (NLP) tasks within an instruction-following framework. This paradigm is exemplified by recent multi-task LLMs, such as T0, FLAN, and OPT-IML. First, multi-task data is gathered with each

Cappy: Outperforming and boosting large multi-task language models with a small scorer Lire l’article »

L’équité en santé : un nouveau cadre pour évaluer les performances de l’apprentissage automatique

L’équité en santé est une préoccupation sociétale majeure dans le monde, les disparités ayant de nombreuses causes, notamment des limitations d’accès aux soins de santé, des différences de traitement clinique et même des différences fondamentales dans les technologies de diagnostic. En dermatologie, par exemple, les résultats du cancer de la peau sont plus mauvais pour

L’équité en santé : un nouveau cadre pour évaluer les performances de l’apprentissage automatique Lire l’article »

MELON: Reconstructing 3D objects from images with unknown poses

Posted by Mark Matthews, Senior Software Engineer, and Dmitry Lagun, Research Scientist, Google Research A person’s prior experience and understanding of the world generally enables them to easily infer what an object looks like in whole, even if only looking at a few 2D pictures of it. Yet the capacity for a computer to reconstruct

MELON: Reconstructing 3D objects from images with unknown poses Lire l’article »

SCIN: A new resource for representative dermatology images

Posted by Pooja Rao, Research Scientist, Google Research Health datasets play a crucial role in research and medical education, but it can be challenging to create a dataset that represents the real world. For example, dermatology conditions are diverse in their appearance and severity and manifest differently across skin tones. Yet, existing dermatology image datasets

SCIN: A new resource for representative dermatology images Lire l’article »

Retour en haut