Solar Energy News  
ROBO SPACE
More-flexible machine learning
by Staff Writers
Boston MA (SPX) Oct 07, 2015


Flickr users tagged a photograph similar to this one "architecture," "tourism," and "travel." A machine-learning system that used a novel training strategy developed at MIT proposed "sky," "roof," and "building"; when it used a conventional training strategy, it came up with "art," "sky," and "beach." Image courtesy MIT News.

Machine learning, which is the basis for most commercial artificial-intelligence systems, is intrinsically probabilistic. An object-recognition algorithm asked to classify a particular image, for instance, might conclude that it has a 60 percent chance of depicting a dog, but a 30 percent chance of depicting a cat.

At the Annual Conference on Neural Information Processing Systems in December, MIT researchers will present a new way of doing machine learning that enables semantically related concepts to reinforce each other. So, for instance, an object-recognition algorithm would learn to weigh the co-occurrence of the classifications "dog" and "Chihuahua" more heavily than it would the co-occurrence of "dog" and "cat."

In experiments, the researchers found that a machine-learning algorithm that used their training strategy did a better job of predicting the tags that human users applied to images on the Flickr website than it did when it used a conventional training strategy.

"When you have a lot of possible categories, the conventional way of dealing with it is that, when you want to learn a model for each one of those categories, you use only data associated with that category," says Chiyuan Zhang, an MIT graduate student in electrical engineering and computer science and one of the new paper's lead authors.

"It's treating all other categories equally unfavorably. Because there are actually semantic similarities between those categories, we develop a way of making use of that semantic similarity to sort of borrow data from close categories to train the model."

Zhang is joined on the paper by his thesis advisor, Tomaso Poggio, the Eugene McDermott Professor in the Brain Sciences and Human Behavior, and by his fellow first author Charlie Frogner, also a graduate student in Poggio's group. Hossein Mobahi, a postdoc in the Computer Science and Artificial Intelligence Laboratory, and Mauricio Araya-Polo, a researcher with Shell Oil, round out the paper's co-authors.

Close counts
To quantify the notion of semantic similarity, the researchers wrote an algorithm that combed through Flickr images identifying tags that tended to co-occur - for instance, "sunshine," "water," and "reflection." The semantic similarity of two words was a function of how frequently they co-occurred.

Ordinarily, a machine-learning algorithm being trained to predict Flickr tags would try to identify visual features that consistently corresponded to particular tags. During training, it would be credited with every tag it got right but penalized for failed predictions.

The MIT researchers' system essentially gives the algorithm partial credit for incorrect tags that are semantically related to the correct tags. Say, for instance, that a waterscape was tagged, among other things, "water," "boat," and "sunshine." With conventional machine learning, a system that tagged that image "water," "boat," "summer" would get no more credit than one that tagged it "water," "boat," "rhinoceros." With the researchers' system, it would, and the credit would be a function of the likelihood that the tags "summer" and "sunshine" co-occur in the Flickr database.

The problem is that assigning partial credit involves much more complicated calculations than simply scoring predictions as true or false. How, for instance, does a system that gets none of the tags completely right - say, "lake," "sail," and "summer" - compare to one that makes only one enormous error - say, "water," "boat," and "rhinoceros"?

To perform this type of complicated evaluation, the researchers use a metric called the Wasserstein distance, which is a way of comparing probability distributions. That would have been prohibitively time-consuming even two years ago, but in 2014, Marco Cuturi of the University of Kyoto and Arnaud Doucet of Oxford University proposed a new algorithm for calculating the Wasserstein distance more efficiently.

The MIT researchers believe that their paper is the first to use the Wasserstein distance as an error metric in supervised machine learning, where the system's performance is gauged against human annotations.

Human error
In experiments, the researchers' system outperformed a conventional machine-learning system even when the criterion of success was simply predicting the tags that Flickr users had applied to a given image. But the difference was even more acute when the criterion of success was the prediction of tags that were semantically similar to those applied by Flickr users.

That may sound circular: A system that factors in semantic similarity is better at predicting semantic similarity. But when a Web user is trying to find images online, a general thematic correspondence may well be more important than a precise intersection of keywords.

Moreover, the tags that users assign to any given Flickr image can be a motley assortment. Automatically generated tags clustered according to semantic similarity could be more useful than those applied by humans. One image in the researchers' test set, for instance, depicted a uniformed mountain biker wearing a crash helmet biking down a hilly trail.

The actual tags were "spring," "race," and "training." But the trees in the image are bare, the grass is brown, and the tags "race" and "training" can't both be right. The researchers' system came up with "road," "bike," and "trail"; the conventional machine-learning algorithm produced "dog," "surf," and "bike."

Finally, if some other measure of the notion of semantic similarity proved better able to capture human intuition than co-occurrence of Flickr tags, then the MIT researchers' system could simply adopt it instead.

Indeed, a longstanding and ongoing project in artificial-intelligence research is the assembly of "ontologies" that relate classification terms hierarchically - dogs are animals, collies are dogs, Lassie was a collie. In future work, the researchers hope to test their system using ontologies standard in machine-vision research.


Thanks for being here;
We need your help. The SpaceDaily news network continues to grow but revenues have never been harder to maintain.

With the rise of Ad Blockers, and Facebook - our traditional revenue sources via quality network advertising continues to decline. And unlike so many other news sites, we don't have a paywall - with those annoying usernames and passwords.

Our news coverage takes time and effort to publish 365 days a year.

If you find our news sites informative and useful then please consider becoming a regular supporter or for now make a one off contribution.
SpaceDaily Contributor
$5 Billed Once


credit card or paypal
SpaceDaily Monthly Supporter
$5 Billed Monthly


paypal only


.


Related Links
Massachusetts Institute of Technology
All about the robots on Earth and beyond!






Comment on this article via your Facebook, Yahoo, AOL, Hotmail login.

Share this article via these popular social media networks
del.icio.usdel.icio.us DiggDigg RedditReddit GoogleGoogle

Previous Report
ROBO SPACE
U.S. Navy orders new robots, servicing
Bedford, Mass. (UPI) Oct 6, 2015
iRobot has received contracts from the U.S. Naval Surface Warfare Center for support services for MK1 robots in use and for production of new units. The two multi-year indefinite-delivery/indefinite-quantity contracts have a combined ceiling value of $96 million. Initial initial orders worth $7.9 million have already been made. "iRobot values its long standing service to multiple ... read more


ROBO SPACE
Study: Africa's urban waste could produce rural electricity

Researchers create inside-out plants to watch how cellulose forms

Microalgae biomass as feedstock for biofuel, food, feed and more

Barley straw shows potential as transport biofuel raw material

ROBO SPACE
More-flexible machine learning

Psychic robot will know what you really meant to do

Bio-inspired robotic finger looks, feels and works like the real thing

U.S. Navy orders new robots, servicing

ROBO SPACE
Adwen and IWES sign agreement for the testing of 8MW turbine

US has fallen behind in offshore wind power

Moventas rolls out breakthrough up-tower planetary repairs for GE fleet

Chinese firm invests in Mexican wind power projects

ROBO SPACE
Scandal-hit VW slams brakes on investment

China auto sales in first rise for 6 months: industry group

VW to recall nearly 2,000 cars in China amid scandal

Dirt-cheap catalyst may lower fuel costs for hydrogen-powered cars

ROBO SPACE
Knit it, braid it, turn it on and use it!

New Oregon approach for 'nanohoops' could energize future devices

Superconductivity trained to promote magnetization

A necklace of fractional vortices

ROBO SPACE
Contract on Construction of Jordan NPP by Russia Likely Within 2 Years

Abu Dhabi to Invest in Russia's Nuclear Projects, Agriculture Sector

Risk of cyber attack on global nuclear facilities growing

Bolivia signs nuclear agreement with Russia's Rosatom

ROBO SPACE
EDF for carbon price floor

Shift from fossil fuels risks popping 'carbon bubble': World Bank

DOE selects UC Berkeley to lead US-China energy and water consortium

Now 'right moment' for carbon tax: IMF chief

ROBO SPACE
Extreme Amazon weather could have global climate consequences

Smithsonian scientists say vines strangle carbon storage in tropical forests

Broadleaf trees show reduced sensitivity to global warming

Study reveals answers for managing Guam's threatened native trees









The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.