Files
Download Full Text (745 KB)
Keywords
compression, tabular data, huffman coding, n-grams
Abstract
Read file and rearrange its contents to a json file that sorts data by column.
Implement Huffman coding using the bitarray package’s canonical Huffman function.
Use the Huffman dictionaries for each column to encode all values.
Compress each line individually using zstandard
BYU ScholarsArchive Citation
Callaway, Maren and Piccolo, Stephen R., "Custom compression algorithm shows potential to reduce tabular data by a magnitude of 17 using Huffman coding and n-grams" (2023). Library/Life Sciences Undergraduate Poster Competition 2023. 57.
https://scholarsarchive.byu.edu/library_studentposters_2023/57
Document Type
Poster
Publication Date
2023-03-02
Language
English
College
Life Sciences
Department
Biology
Copyright Use Information
https://lib.byu.edu/about/copyright/