A Posit8 Decompression Operator for Deep Neural Network...

Research and Publications

A Posit8 Decompression Operator for Deep Neural Network Inference

We propose a hardware operator to decompress Posit8 representations with exponent sizes 0, 1, 2, 3 to the IEEE 754 binary 16 (FP16) representation. The motivation is to leverage the tensor units of a manycore processor that already supports FP16.32 matrix multiply-accumulate operations for deep learning inference. According to our experiments, adding instructions to decompress Posit8 into FP16 numbers would enable to further reduce the footprint of deep neural network parameters with an acceptable loss of accuracy or precision. We present the design of our decompression operator and compare it to lookup-table implementations for the technology node of the targeted processor.

SUBSCRIBE FOR PRODUCT UPDATES >

BOOK A MEETING >

ALREADY A PORTAL USER?

LOGIN TO THE PARTNER PORTAL

LOGIN TO NEW PARTNER PORTAL

NO ACCESS TO PORTAL?

REGISTER FOR THE PARTNER PORTAL

REGISTER A DEAL >

Request TRAINING >

ALL RESOURCES

Kalray DPU Resources

Data Solutions

Visit Resources Library >

COMPANY

NEWS

INVESTOR INFORMATION >

A Posit8 Decompression Operator for Deep Neural Network Inference

COMPANY

Resources

MARKETS

Use Cases

Engage

PRODUCT UPDATES