Sample: scan
Minimum spec: SM 2.0

This example demonstrates an efficient CUDA implementation of parallel prefix sum, also known as "scan".  Given an array of numbers, scan computes a new array in which each element is the sum of all the elements before it in the input array.

Key concepts:
Data-Parallel Algorithms
Performance Strategies
