Big Data & Data Mining
Oct 24, 2023
Reduce Apache Spark ML Compute Costs with New Algorithms in Spark RAPIDS ML Library
Spark RAPIDS ML is an open-source Python package enabling NVIDIA GPU acceleration of PySpark MLlib. It offers PySpark MLlib DataFrame API compatibility and...
8 MIN READ
Jun 12, 2023
Distributed Deep Learning Made Easy with Spark 3.4
Apache Spark is an industry-leading platform for distributed extract, transform, and load (ETL) workloads on large-scale data. However, with the advent of deep...
7 MIN READ
Jun 02, 2023
GPU Integration Propels Data Center Efficiency and Cost Savings for Taboola
When you see a context-relevant advertisement on a web page, it's most likely content served by a Taboola data pipeline. As the leading content recommendation...
13 MIN READ
Feb 08, 2023
Categorical Features in XGBoost Without Manual Encoding
XGBoost is a decision-tree–based, ensemble machine learning algorithm based on gradient boosting. However, until recently, it didn’t natively support...
5 MIN READ
Dec 05, 2022
Scraping Real-Estate Sites for Data Acquisition with Scrapy
Data is one of the most valuable assets that a business can possess. It sits at the core of data science and data analysis: without data, they’re both...
13 MIN READ
Jul 29, 2022
Evaluating Data Lakes and Data Warehouses as Machine Learning Data Repositories
Data is the lifeblood of modern enterprises, whether you’re a retailer, financial service company, or digital advertiser. Across industries, organizations are...
11 MIN READ
Jun 29, 2022
Improving Enterprise IT Fraud Prevention
Any business or industry, from retail and healthcare to financial services, is subject to fraud. The cost of fraud can be staggering. Every $1 of fraud loss...
7 MIN READ
Apr 12, 2021
Cloudera and NVIDIA Collaborate to Accelerate Data Analytics and AI at Scale
Data engineering and data science workflows are often limited by the ability of platforms to process massively growing amounts of data. The integration of the...
4 MIN READ
Mar 19, 2021
Power Your Big Data Analytics with the Latest NVIDIA GPUs in the Cloud
Dask is an accessible and powerful solution for natively scaling Python analytics. Using familiar interfaces, it allows data scientists familiar with PyData...
2 MIN READ
Jan 24, 2018
Oil Giant Launches Supercomputer to Analyze Subsoil Data
The Italian multinational oil giant Eni deployed a 18.6 petaflops GPU-accelerated supercomputer, making it the most powerful industrial system in the world....
2 MIN READ
May 03, 2017
Developer Spotlight: Applying Deep Learning to Aerospace Technologies and Integrated Systems
Vivek Venugopalan, a staff research scientist at the United Technologies Research Center (UTRC) shares how they are using deep learning and GPUs to understand...
1 MIN READ
Apr 19, 2017
Scientists Capture First Image of a Black Hole
Astronomers from around the world pointed their powerful telescopes towards a supermassive black hole that lies in the center of the Milky Way (nearly 26,000...
3 MIN READ
Mar 31, 2017
Sorting Shopping Lists with Artificial Intelligence
Instacart, an Internet-based grocery delivery service, shares how they are using deep learning to help their tens of thousands personal shoppers be more...
1 MIN READ
Mar 23, 2017
AI System Helps Detect and Manage Traffic Incidents
Iowa State University researchers are developing a deep learning-based system to help the Iowa Department of Transportation improve incident detection and...
2 MIN READ
Mar 02, 2017
AI System Beats Pros at Texas Hold’em
A team of researchers from University of Alberta, Charles University in Prague and Czech Technical University developed an AI system called DeepStack that...
2 MIN READ
Feb 21, 2017
AI Helps Autonomous Vehicles Locate Themselves
Researchers at the NYU Tandon School of Engineering are developing an artificial intelligence system for autonomous vehicles that links them to HERE Live Map...
2 MIN READ