Page 33 - EE Times Europe Magazine | April2019
P. 33

EE|Times EUROPE   33

                                                         Top 10 Processors for AI Acceleration at the Endpoint


        (MMA), which can achieve 8 TOPS.                      resources to save power and cost.
          The SoC can handle a video stream from a front-mounted camera   The KL520 is available now and can also be found on an accelerator
        at up to 8 MP or a combination of four to six 3-MP cameras plus radar,   card from manufacturer AAEON (the M2AI-2280-520).
        LiDAR, and ultrasonic sensors. The MMA might be used to perform
        sensor fusion on these inputs in an automated valet parking system,   Gyrfalcon Lightspeeur 5801
        for example.                                          Designed for the consumer electronics market, Gyrfalcon’s Lightspeeur
          The TDA4VM is designed for ADAS designs between 5 and 20 W. The   5801 offers 2.8 TOPS at 224-mW power consumption (the equivalent of
        device is in pre-production, but development kits are available now.  12.6 TOPS/W) with 4-ms latency. Gyrfalcon uses a processor-in-memory
                                                              technique that is particularly power-efficient compared with other archi-
        GPU                                                   tectures. Power consumption can actually be traded off with clock speed
        Nvidia Corp. Jetson Nano                              by varying the clock speed between 50 and 200 MHz. Lightspeeur 5801
        Nvidia’s well-known Jetson Nano is a small but powerful graphics   contains 10 MB of memory, so entire models can fit on the chip.
        processing unit (GPU) module for AI applications in endpoint devices.   The part is the company’s fourth production chip and is already
        Built on the same Maxwell architecture as larger members of the Jetson   found in LG’s Q70 mid-range smartphone, where it handles inference
                                           family (AGX Xavier   for camera effects. A USB thumb drive development kit, the 5801 Plai
                                           and TX2), the GPU   Plug, is available now.
                                           on the Nano module
                                                              ULTRA-LOW-POWER
                                           Nvidia’s Jetson    Eta Compute ECM3532
                                           Nano module        Eta Compute’s first production product, the ECM3532, is designed for
                                           houses a powerful   AI acceleration in battery-powered or energy-harvesting designs for
                                           GPU with 128 cores   IoT. Always-on applications in image processing and sensor fusion can
                                           for AI at the edge.   be achieved with a power budget as low as 100 µW.
                                           (Image: Nvidia Corp.)  The chip has two cores — an Arm Cortex-M3 microcontroller core
                                                              and an NXP CoolFlux DSP. The company uses a proprietary voltage and
        has 128 cores and is capable of 0.5 TFLOPS, enough to run multiple   frequency scaling technique, which adjusts every clock cycle, to wring
        neural networks on several streams of data from high-resolution image   every last drop of power out of both cores. Machine-learning workloads
        sensors, according to the company. It consumes as little as 5 W when in   can be processed by either core (some voice workloads, for example, are
        use. The module also features a quad-core Arm Cortex-A57 CPU.  better-suited to the DSP).
          Like other parts in Nvidia’s range, the Jetson Nano uses CUDA X,   Samples of the ECM3532 are available now, and mass production is
        Nvidia’s collection of acceleration libraries for neural networks. Inex-  expected to start in the second quarter.
        pensive Jetson Nano development kits are widely available.
                                                              Syntiant Corp. NDP100
        CONSUMER CO-PROCESSORS                                U.S. startup Syntiant’s NDP100
        Kneron Inc. KL520                                     processor is designed for
        The first offering from American-Taiwanese startup Kneron is the   machine-learning inference on
        KL520 neural network processor, designed for image processing and   voice commands in applications
        facial recognition in applications such as smart homes, security sys-  in which power is tight. Its
        tems, and mobile devices. It’s optimized to run convolutional neural   processor-in-memory–based
        networks (CNNs), the type commonly used in image processing today.  silicon consumes less than 140 µW
          The KL520 can run 0.3 TOPS and consumes 0.5 W (equivalent to   of active power and can run models
        0.6 TOPS/W), which the company said is sufficient for accurate facial   for keyword spotting, wake word
        recognition, given that the chip’s MAC efficiency is high (over 90%).   detection, speaker identification, or
        The chip architecture is reconfigurable and can be tailored to differ-  event classification.   Syntiant’s NDP100 is de-
        ent CNN models. The company’s complementary compiler also uses   The product will be used to enable   signed for voice processing
        compression techniques in order to run bigger models within the chip’s   hands-free operation of consumer   in ultra-low-power applica-
                                                              devices such as earbuds, hearing   tions. (Image: Syntiant Corp.)
                                                              aids, smartwatches, and remote
                                                              controls, according to Syntiant.
                                                                Development kits are available now.
                                                              GreenWaves Technologies GAP9
                                                              GAP9, the first ultra-low-power application processor from French
                                                              startup GreenWaves, has a powerful compute cluster of nine RISC-V
                                                              cores whose instruction set has been heavily customized to optimize
                                                              the power consumed. It features bidirectional multi-channel audio
                                                              interfaces and 1.6 MB of internal RAM.
                                                                GAP9 can handle neural network workloads for images, sounds, and
                                                              vibration sensing in battery-powered IoT devices. GreenWaves’ figures
                                                              have GAP9 running MobileNet V1 on 160 × 160 images, with a channel
                                                              scaling of 0.25 in just 12 ms and with a power consumption of
        Kneron’s KL520 uses a reconfigurable architecture and clever com-  806 µW/frame/second. ■
        pression to run image processing in mobile and consumer devices.
        (Image: Kneron Inc.)                                  Sally Ward-Foxton is a staff correspondent at AspenCore.

                                                                                       www.eetimes.eu | APRIL 2020
   28   29   30   31   32   33   34   35   36   37   38