SERVERS

The so-called multi-view is a way of linking two different signals by considering the information they share about the same object despite differences. Multi-view may open a path to machines that can have a richer sense of the structure of the world, perhaps contributing to the goal of machines that can "reason" and "plan."

Tiernan Ray and DALL*E, ""Framed portraits of multiple views of an apple"

Artificial intelligence in its most successful form -- things like ChatGPT or DeepMind's AlphaFold to predict proteins -- has been trapped in one conspicuously narrow dimension: The AI sees things from only one side, as a word, as an image, as a coordinate in space -- as any type of data, but only one at a time.

In very short order, neural networks are about to expand dramatically with a fusion of data forms that will look at life from many sides. It's an important development, for it may give neural networks greater grounding in the ways that the world coheres, the ways that things hold together, which could be an important stage in the movement toward programs that can one day perform what you would call "reasoning" and "planning" about the world.

Also: Meta unveils 'Seamless' speech-to-speech translator

The coming wave of multi-sided data has its roots in years of study by machine learning scientists, and generally goes by the name of "multi-view," or, alternately, data fusion. There's even an academic journal dedicated to the topic, called Information Fusion, published by scholarly publishing giant Elsevier.

Data fusion's profound idea is that anything in the world one is trying to examine has many sides to it at once. A web page, for example, has both the text you see with the naked eye, and the anchor text that links to that page, or even a third thing, the underlying HTML and CSS code that is the structure of the page.

An image of a person can have both a label for the person's name, and also the pixels of the image. A video has a frame of video but also the audio clip accompanying that frame.

Today's AI programs treat such varying data as separate pieces of information about the world, with little to no connection between them. Even when neural nets handle multiple kinds of data, such as text and audio, the most they do is process those data sets simultaneously -- they don't explicitly link multiple kinds of data with an understanding that they are views of the same object.

For example, Meta Properties -- owner of Facebook, Instagram, and WhatsApp -- on Tuesday unveiled its latest effort in machine translation, a tour de force in using multiple modalities of data. The program, SeamlessM4T, is trained on both speech data and text data at the same time, and can generate both text and audio for any task.

But SeamlessM4T doesn't perceive each unit of each signal as a facet of the same object.

Also:Meta's AI image generator says language may be all you need

That fractured view of things is beginning to change. In a paper published recently by New York University assistant professor and faculty fellow Ravid Shwartz-Ziv, and Meta's chief AI scientist, Yann LeCun, the duo discuss the goal of using multi-view to enrich deep learning neural networks by representing objects from multiple perspectives.

Objects are fractured into unrelated signals in today's deep neural networks. The coming wave of multi-modality, employing images plus sounds plus text plus point clouds, graph networks, and many other kinds of signals, may begin to put together a richer model of the structure of things.

Tiernan Ray and DALL*E, "An apple looking at its reflection in a large, square mirror with an elegant gilded frame."

In the highly technical, and rather theoretical paper, posted on the arXiv pre-print server in April, Shwartz-Ziv and LeCun write that "the success of deep learning in various application domains has led to a growing interest in deep multiview methods, which have shown promising results."

Multi-view is heading toward a moment of destiny, as today's increasingly large neural networks -- such as SeamlessM4T -- take on more and more modalities, known as "multi-modal" AI.

Also: The best AI chatbots of 2023: ChatGPT and alternatives

The future of so-called generative AI, programs such as ChatGPT and Stable Diffusion, will combine a plethora of modalities into a single program, including not only text and images and video, but also point clouds and knowledge graphs, even bio-informatics data, and many more views of a scene or of an object.

The many different modalities offer potentially thousands of "views" of things, views that could contain mutual information, which could be a very rich approach to understanding the world. But it also raises challenges.

The key to multi-view in deep neural networks is a concept that Shwartz-Ziv and others have hypothesized known as an "information bottleneck." The information bottleneck becomes problematic as the number of modalities expands.

An information bottleneck is a key concept in machine learning. In the hidden layers of a deep network, the thinking goes, the input of the network is stripped down to those things most essential to output a reconstruction of the input, a form of compression and decompression.

Tiernan Ray and DALL*E, "glass bottle lying on its side, side view"+"multiple apples"+"green apple"+"and there is another apple made of green translucent glass to the right of the bottle"

In an information bottleneck, multiple inputs are combined in a "representation" that extracts the salient details shared by the inputs as different views of the same object. In a second stage, that representation is then pared down to a compressed form that contains only the essential elements of the input necessary to predict an output that corresponds to that object. That process of amassing mutual information, and then stripping away or compressing all but the essentials, is the bottleneck of information.

The challenge for multi-view in large multi-modal networks is how to know what information from all the different views is essential for the many tasks that a giant neural net will perform with all those different modalities.

Also: You can build your own AI chatbot with this drag-and-drop tool

As a simple example, a neural network performing a text-based task such as ChatGPT, producing sentences of text, could break down when it has to also, say, produce images, if the details relevant for the latter task have been discarded during the compression stage.

As Shwartz-Ziv and LeCun write, "[S]eparating information into relevant and irrelevant components becomes challenging, often leading to suboptimal performance."

There's no clear answer yet to this problem, the scholars declare. It will require further research; in particular, redefining the multi-view from something that includes only two different views of an object to possibly many views.

"To ensure the optimality of this objective, we must expand the multiview assumption to include more than two views," they write. In particular, the traditional approach to multi-view assumes "that relevant information is shared among all different views and tasks, which might be overly restrictive," they add. It might be that views share only some information in some contexts.

Also: This is how generative AI will change the gig economy for the better

"As a result," they conclude, "defining and analyzing a more refined version of this naive solution is essential."

No doubt, the rise of multi-modality will push the science of multi-view to devise new solutions. The explosion of multi-modality in practice will lead to new theoretical breakthroughs for AI.

Artificial Intelligence

Generative AI will far surpass what ChatGPT can do. Here's everything on how the tech advancesChatGPT's new web browsing feature is a big disappointment. Use this plugin insteadWhat is Amazon Bedrock? 4 ways it can help businesses use generative AI toolsCan generative AI solve computer science's greatest unsolved problem?

Generative AI will far surpass what ChatGPT can do. Here's everything on how the tech advances
ChatGPT's new web browsing feature is a big disappointment. Use this plugin instead
What is Amazon Bedrock? 4 ways it can help businesses use generative AI tools
Can generative AI solve computer science's greatest unsolved problem?

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVERS

HOT NEWS

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Huawei Switches Distributor in UAE

PoE vs PoE+ vs UPoE: What's the best switch to meet your network needs?

Understanding PoE Standards and Wattage

Power Supply Standards for POE Switches. Why is the Power Supply Distance Limited to 100 Meters?

How to Choose the Right 10G SFP+ Module: SR, LR, or LRM?

Huawei Switches: Comprehensive Guide and Insights

How Does Cisco Wireless Network Work?

How Do I Connect to a Cisco Wireless Router?

Cisco Catalyst 9800 Series Wireless Controller Software Configuration Guide

AI's multi-view wave is coming, and it will be powerful

Artificial Intelligence

Hot Tags : Artificial Intelligence Innovation

Ordering Guide

Resources

About Us

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVERS

HOT NEWS

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

​Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Huawei Switches Distributor in UAE

PoE vs PoE+ vs UPoE: What's the best switch to meet your network needs?

Understanding PoE Standards and Wattage

Power Supply Standards for POE Switches. Why is the Power Supply Distance Limited to 100 Meters?

How to Choose the Right 10G SFP+ Module: SR, LR, or LRM?

Huawei Switches: Comprehensive Guide and Insights

How Does Cisco Wireless Network Work?

How Do I Connect to a Cisco Wireless Router?

Cisco Catalyst 9800 Series Wireless Controller Software Configuration Guide

AI's multi-view wave is coming, and it will be powerful

Artificial Intelligence

Hot Tags : Artificial Intelligence Innovation

Ordering Guide

Resources

About Us

Introduction to Huawei CloudEngine S6730-H Series Switches