Friday, February 27, 2015

Microsoft to accelerate Bing search with neural network


Altera

Google’s engineers have previously estimated that a typical 0.2-second web query reflects a quantity of work spent in indexing and retrieval equal to about 0.0003 kWh of energy per search. With over 100 billion looks per month at their petabyte index, well-executed page ranking has become a formidable proposition. Microsoft’s approach with Bing has been to break the ranking portion of search into three parts — feature extraction, free-form expressions, and machine learning scoring:When we search Google’s web index, we are only searching around 10 percent of the half-a-trillion or so pages that are potentially available. Much of the content in the larger deep web — not to be confused with the dark web — is buried further down in the sites that make up the visible surface web. The indexes of competitors like Yahoo and Bing (around 15 billion pages each) are still only half as large as Google’s. To close this gap, Microsoft has recently pioneered sophisticated new Field-Programmable Gate Array (FPGA) technology to make massive web crawls more efficient, and faster.

FPGAworkflow
Bing’s document selection service, which retrieves and filters the documents containing search terms, still runs on Xeon processors. Their ranking services, which score the filtered documents according to the relevance of the search results, have recently been ported to an FPGA-based system they call Project Catapult. Microsoft could likely afford custom ASICs (application-specific integrated circuits) to accelerate Bing’s ranking functions. But given the speed at which the software algorithms now change, it probably can’t afford not to use programmable FPGA hardware instead.
Traditionally FPGAs have been the go-to device for very specific computing needs. Because you can easily reconfigure their internal structure, they’re frequently used for prototyping processors. They’re also handy for applications where a large number of input or output connections on the chip are needed. But there is another place where they are used no matter what the cost, and that is for where absolute speed is of the essence. For example, if your device needs to calculate the total energy of all the hits to a massive satellite-based cosmic-ray detecting scintillator array, decide which hits are real, and do it all in a few nanoseconds, software simply isn’t up to the job.
Project Catapult was originally based on a PCI-Express Card design from 2011 that used six Xilinx FPGAs linked with a controller. Integrating new devices into their existing servers, however, required several redesigns to adhere to strict limits on how much power the devices would draw, and how much heat they would radiate. Their latest design now uses a Stratix V GS D5 FPGA from Altera. For the hardcore FPGA crowd, this particular device has 1,590 digital signal processing blocks, 2,104 M20K memory blocks, and thirty-six 14.1 Gb/sec transceivers. As the Bing team announced last June at ISCA 2014, this platform enabled ranking with roughly half the number of servers used before.
Catapult
The term Microsoft is using here is “convolution neural network accelerator.” Convolution is commonly used in signal processing applications like computer vision, speech recognition, or anywhere where special averaging or cross-correlation would be of service. In computer vision for example, 2D convolution can be used to massage each pixel using information from its immediate neighbors to achieve various filtering effects. Convolutional neural networks (CNNS) are composed of small assemblies of artificial neurons, where each focuses on a just small part of an image — their receptive field. CNNs have already bested humans in classifying objects in challenges like the ImageNet 1000. Classifying documents for ranking is a similar problem, which is now one among many Microsoft hopes to address with CNNs.
As we speak Microsoft’s engineers are looking to start using Altera’s new Arria 10 FPGA. This chip is optimized for the kinds of floating-point intensive operations that were traditionally the province of DSPs. Able to run at Teraflop speeds with three times the energy efficiency of a comparable GPU, Microsoft hopes it will help them to make significant gains in the search-and-rank business.

No comments:

Post a Comment

Facebook Friends

Labels

Microsoft Nvidia Security Amd Google Android Apple Samsung Windows 10 Autos Science Smartphones Apps Automobiles Cars Encryption Gpu Intel Ios Iphone Nasa Pc Ps4 Software VW Volkswagen Xbox One 3d Printing Gaming Gtx 980 Hardware Internet Maxwell Mobile Operating Systems PC Gaming Privacy Ps3 Tesla Wearables Windows diesel diesel engines pollution space 3d Printers 4g 8K Apple Car Astrophysics BMW Comcast Cpu Developers Directx 12 Displays Google Fiber Graphics Gtx 970 Holograms How-To ICar Lte Malware Medicine Memory Nanoparticles OLED Oculus Rift PCS Piracy Project Morpheus Quantum Entanglement Robotics Smart Tv Spectrum Tablets Torrents Uhd Valve Virtual Reality Wearable Computing Windows 9 Wireless Communications Xbox 360 clean diesels diesels emissions government ipad lcd lg 1080p 2160p 21:9 34UC87C 3D 3d Photography 4K 4KTv A123 Systems ABP ARM AT&T ATS Acura RDX Ads Aero Air Gap Airmont Alienware Amazon American truck simulator App Apple Pay Apple Watch Apu Astronomy Asus Atmosphere Atom Audi Augmented Reality Auto Show Top Cars Auto Shows Avg Bandwidth Batteries Battery Life Bing Bittorrent Blizzard BlueStacks Bonan Brain Brains Branson Braswell Broadwell Business CALL OF DUTY: BLACK OPS 3 CNNIC Cameras Cancer Car Shows Carrington Event Certificate Authority Charging Chevrolet Equinox Chicago Auto Show Chips Chromebook Pixel Chromebook Pixel 2 Chromebooks Climate Change Console DX11 David Irvine Dci Deals Dell Diablo 3 Directx Dota 2 Downloads Dream Dx12 EPA EVs Eighth Generation Electric Vehicles Electromagnetic Electrons Enterprise Enthusiast Et European Union Exoplanets Female Festival Fiber Firefox 41 Fisker Karma Fitness Tracker Ford Explorer Ford Police Interceptor Formula E Fukushima Fukushima Daiichi Future GRIP Digital GTX Game Streaming Gamers Gchq Google Wireless Gorillapod Grid HDTV Health Heat Holographic Displays Holographic Storage Holographic Universe Holography Hololens Honda Pilot Htc Http Huang IETF ISPs Icera Illumiroom Imagination Technologies Imaging Inkjet Printers Internet Of Things IoT Iphone 6 Iphone 6 Plus Itanic Itanium Keller Kinect Kinectic Energy Kittson LED Laptops Lasers Latency Lenovo Lidar Light Liquid Metal Lithium-Ion Low Latency API MCS Holdings MCV MRI Machine Learning Magnetic Field Man In The Middle Mantle Masturbate Medical Medical Imaging Mercedes Microsoft Access Microsoft Excel Microsoft Office 2016 Microsoft Office 365 Microsoft One Microsoft PowerPoint Microsoft Research Microsoft Word Miscrosoft Mobile Computing Model S Modems Modems 0 Comments Mozilla Muon Tomography Mvno NOx NSA Navigation Net Neutrality Networking Neural Networks Neurology Nexus 6 Noaa Nokia Note Nova Nuclear Nuclear Power Nvidia Shield Office 365 Online Ouya Overclocking Paid Paintings Palmer Particles Performance Photography Physics Pirate Bay Plaintext Plasma PornHub Pornhub Wankband Porsche Poulson Power VR Project Tango ProtocolI Qualcomm Quantum Computing Quantum Mechanics Quantum Physics R9 290X RAM RPV Radeon Richard Branson Russia SCS SEC SLS SOEDESCO SSL/TLS Same-Day Delivery Samsung Galaxy Note SanDisk Scanning Sdk Search Seattle Senate Launch System Servers Setup Sharp Shield Silvermont Siri Slideshow Smartphone Smartwatches Snowden Society Soft Robotics Software As A Service Solar Sony Sound Sound Waves Spacetime Spectroscopy. ESO Spying Stars Steam Stellar Dynamics Observatory SDO SunLock Superfish Swarm TDI THQ TPB TSA TSA locks Tegra Terrible Posture Games Titan Titan Black Titan Z Torrentfreak Tower of Guns Toyota Avalon Toyota Camry Toyota Corolla Travel Sentry Uhdtv Ultra VR Verizon Versioning Very Large Telescope Video Games Virgin Virgin Atlantic Virtualbox Virtualization Viruses Voice Wankband Wide Wifi Windows 10 Technical Preview Windows 8 Windows Nt 6.4 Windows Phone WindowsI Wireless Spectrum World Of Warcraft Xbox Xbox Live Xeon Yahoo Yandex Zen ai apollo artificial intelligence artist backdoor biology bitcoin blender block broadband broadcast car sales chain chrome clean diesel corporate fraude court cpus diesel emissions dreamcast edgeadblock plus electric cars etherium ets ets2 euro truck simultor euro truck simultor 2 exploration fifth amendment fingerprint firefox freedom game development game theory games genetic engineering geometry gtx 980m heed Martin high speed cameras iMo iMove ibm internet explorer k12 konami language learning licenses mars math metal gear solid metal gear solid 5 microSD microsoft office missions mobile gaming model e model x moon nano-optics nanotechnology netflix orion os pHTTP/2 pachinko passcode programming robots rock paper scissors scandinavian security theater sega self-incrimination smart contracts space launch system telecommunications terahertz trucks tv waveguides x86