Saving data as binary

Question

Adel Hafri 2022년 5월 14일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1719395-saving-data-as-binary

댓글: Walter Roberson 2022년 5월 20일

Basically, i have for example k = [0 5 4], i want it to be saved as [0 101 100] instead of [00000000 00000101 00000100] so that it takes the least size possible, how can i do that ?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Voss 2022년 5월 14일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1719395-saving-data-as-binary#answer_964085

MATLAB Online에서 열기

k = [0 5 4];
arrayfun(@(x)dec2bin(x,max(1,ceil(log2(x)))),k,'UniformOutput',false)
ans = 1×3 cell array
    {'0'}    {'101'}    {'100'}

댓글 수: 11
이전 댓글 9개 표시이전 댓글 9개 숨기기

Voss 2022년 5월 15일

편집: Voss 2022년 5월 15일

MATLAB Online에서 열기

Maybe use uint8 (or uint16, uint32, uint64, depending on the range of your data, or possibly their signed counterparts int8, etc.) instead of character arrays.

Exploring the amount of storage used for various data types:

x = '00000001'; % 1-by-8 character array
whos x % 16 bytes (but could be made 8 using a different encoding)
  Name      Size            Bytes  Class    Attributes

  x         1x8                16  char               
x = '1'; % scalar character
whos x % 2 bytes (but could be 1 with different encoding)
  Name      Size            Bytes  Class    Attributes

  x         1x1                 2  char               
x = false(1,8); % 1-by-8 logical array. you might think this 
x(end) = true;  % would be 8 bits, but in fact it's 8 bytes
x
x = 1×8 logical array
   0   0   0   0   0   0   0   1
whos x % 8 bytes
  Name      Size            Bytes  Class      Attributes

  x         1x8                 8  logical              
x = true % scalar logical
x = logical
   1
whos x % 1 byte
  Name      Size            Bytes  Class      Attributes

  x         1x1                 1  logical              
x = 1; % double-precision floating point number (8 bytes)
whos x % 8 bytes
  Name      Size            Bytes  Class     Attributes

  x         1x1                 8  double              
x = uint8(1); % unsigned 8-bit integer (1 byte)
whos x % 1 byte
  Name      Size            Bytes  Class    Attributes

  x         1x1                 1  uint8              
x = [0 5 4]; % 3 doubles
whos x % 24 bytes
  Name      Size            Bytes  Class     Attributes

  x         1x3                24  double              
x = uint8(x); % 3 uint8's
whos x % 3 bytes
  Name      Size            Bytes  Class    Attributes

  x         1x3                 3  uint8              

By the way, trying to get down to less than one byte, e.g., storing 1 as 1 bit and storing 4 = 100 as 3 bits will make the resulting file impossible to decode. For instance, if your file contains the sequence of bits 1100 somewhere, you would not know whether that should be interpreted as:

1100 (i.e., decimal 12), or
110, 0 (i.e., decimal 6, 0), or
11, 0, 0 (i.e., decimal 3, 0, 0), or
1, 100 (i.e., decimal 1, 4), or
1, 10, 0 (i.e., decimal 1, 2, 0), or
1, 1, 0, 0 (i.e., decimal 1, 1, 0, 0)

All six of those interpretations use the minimum number of bits required for each decimal number (i.e., no leading zeros).

[ The other two possible interpretations:

11, 00 (i.e., decimal 3, 0), and
1, 1, 00 (i.e., decimal 1, 1, 0)

do not meet the requirement that every number is encoded with the minimum number of bits (i.e., they have leading zeros: decimal 0 is bits 00 instead of bit 0), so they could be ruled out. ]

It's an interesting problem to think about:

https://en.wikipedia.org/wiki/Prefix_code

https://en.wikipedia.org/wiki/Huffman_coding

Adel Hafri 2022년 5월 15일

can you please go more into detail about how to use fwrite exactly ?

here is more explination what i wanna do exactly:

okay so i have a 750x750 jpeg pictures with values ranging from 0 to 255 and im supposed to apply losless image compression algorithms to reduce the size of those pictures, lossless image compression algorithms such as huffman work by reducing the length of frequent occuring symbols, for example if 150 was my most occuring then my huffman algorithm gives it the code 0 for example and so i ll be saving 7 bits times the frequncy of that data which means compression, the problem is matlab automatically makes that 1 bit length 0 into a 00000000 so essentialy, my algorithm is pointless since matlab will make all the data 8 bit length again, so i want a way to save data exactly the size i want, whether 1 bit,2bits.3....etc instead of it forcing all data to be 8bits

here is an example of how the algorithm changes the symbols

the picture i used isnt the best example of compression but you can get the idea

Walter Roberson 2022년 5월 20일

MATLAB Online에서 열기

bits = {[1] [0 0] [1] [0 1 1] }
Bitstream = [bits{:}];
fid = fopen('test.bin','w');
fwrite(fid, Bitstream, 'bit1');
fclose(fid);

댓글을 달려면 로그인하십시오.

Answer 2

Ilya Dikariev 2022년 5월 20일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1719395-saving-data-as-binary#answer_968030

k_new=str2num(dec2bin(k))' would do. But if you want to still reduce the the size, just use dec2bin which keeps the data in char type which is 8 times smaller

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Walter Roberson 2022년 5월 20일

편집: Walter Roberson 2022년 5월 20일

only 4 times smaller. Each character needs 16 bits.

If you uint8(k_new) then that would need only one byte per value

댓글을 달려면 로그인하십시오.

Saving data as binary

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (2개)

댓글 수: 11
이전 댓글 9개 표시이전 댓글 9개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

Saving data as binary

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (2개)

댓글 수: 11 이전 댓글 9개 표시이전 댓글 9개 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 11
이전 댓글 9개 표시이전 댓글 9개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기