Files

Download

Download Full Text (745 KB)

Keywords

compression, tabular data, huffman coding, n-grams

Abstract

Read file and rearrange its contents to a json file that sorts data by column.

Implement Huffman coding using the bitarray package’s canonical Huffman function.

Use the Huffman dictionaries for each column to encode all values.

Compress each line individually using zstandard

Document Type

Poster

Publication Date

2023-03-02

Language

English

College

Life Sciences

Department

Biology

University Standing at Time of Publication

Senior

Custom compression algorithm shows potential to reduce tabular data by a magnitude of 17 using Huffman coding and n-grams

Included in

Life Sciences Commons

Share

COinS