Meta's Little Llama Model Comes With Big Benefits For Ai Researchers

SERVERS

asbe/Getty Images

Large language models have taken the tech world by storm. They power AI tools -- like ChatGPT and other conversational models, for instance -- that can solve problems, answer questions, make predictions, and more. However, using these tools come with significant risk. They've been known to make plausible-sounding but untrue declarations, generate toxic content, and mimic the bias rooted in AI training data.

To help researchers address those problems, Meta, on Friday, announced the release of a new large language model called LLaMA (Large Language Model Meta AI). The company is making it available under a noncommercial license focused on research use cases, with plans to grant access on a case-by-case basis. It will be available to academic researchers; those affiliated with organizations in government, civil society, and academia; and industry research laboratories around the world.

Also: These experts are racing to protect AI from hackers. Time is running out

What's interesting about LLaMA is that it's relatively little.

As the name suggests, large language models are pretty big. It takes huge sums of data on language (whether that's spoken language, computer code, genetic data, or other "languages") to create an AI model sophisticated enough to solve problems in that language, find answers, or generate its own compositions.

"Training smaller foundation models like LLaMA is desirable in the large language model space because it requires far less computing power and resources to test new approaches, validate others' work, and explore new use cases," Meta noted.

To train a relatively "small" LLM, Meta used "tokens" -- pieces of words, rather than whole words. Meta trained LLaMA on text from 20 languages with the most speakers, focusing on those with Latin and Cyrillic alphabets.

LLaMA is actually a collection of models, ranging from 7 billion to 65 billion parameters. LLaMA 65B and LLaMA 33B were trained with 1.4 trillion tokens, while the smallest model, LLaMA 7B, was trained on one trillion tokens. The models were trained using only publicly available datasets.

Also: How to use ChatGPT: Everything you need to know

While they are small, the LLaMA models are powerful. Meta said LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, while LLaMA-65B is competitive with the best models, Chinchilla70B and PaLM-540B.

LLaMA is also valuable to the research community as a set of foundation models. Foundation models are trained on unlabeled data, meaning they can be tailored for a wide range of use cases.

Meta will make LLaMA available at several sizes (7B, 13B, 33B, and 65B parameters) and is also sharing a LLAMA model card that details how it built the model. The company is also providing a set of evaluations on benchmarks evaluating model biases and toxicity, so that researchers can understand LLaMA's limitations and advance research in these areas.

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVERS

HOT NEWS

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Meta's little LLaMA model comes with big benefits for AI researchers

See also

Hot Tags : Artificial Intelligence Innovation

Ordering Guide

Resources

About Us

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVERS

HOT NEWS

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

​Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Meta's little LLaMA model comes with big benefits for AI researchers

See also

Hot Tags : Artificial Intelligence Innovation

Ordering Guide

Resources

About Us

Introduction to Huawei CloudEngine S6730-H Series Switches