Sunday 9 September 2007 — This is more than 17 years old. Be careful.
Discussion of what size external disk drive to buy (I’m thinking of a 500Gb model to put an end to data squeeze for a good long time) lead to a question about the meaning of the SI prefix tera-. Wikipedia’s SI prefix page explains that the prefixes started from Greek roots for large things, then switched over to numbers as sources:
- giga- (G, 109): From the Greek root for giant.
- tera- (T, 1012): From the Greek root for monster.
- peta- (P, 1015): Since tera- looks like tetra- (Greek for four with a letter missing), peta- is from penta-, Greek for five with a letter missing.
- exa- (E, 1018): From the Greek prefix hexa- (six), with a letter missing.
- zetta- (Z, 1021): From the Latin septum (seven), with the p dropped, and the first letter changed from s to avoid confusion with existing SI symbols.
- yotta- (Y, 1024): From the Greek octo- (eight), with the c dropped and a y added to avoid having a symbol of O, indistinguishable from zero.
The official Bureau International de Poids et Mesures page about the prefixes is interesting because of its links to the actual resolutions that created the prefixes. Unlike ISO, BIPM manages to make their official output refreshingly brief: their resolutions are only a few paragraphs at most.
One aspect of this I don’t understand: why the BIPM approves new prefixes in such small increments. Peta- and exa- appeared in 1975, and zetta- and yotta- in 1991. Why dole them out so slowly? Of course, there are proposals, including one that goes all the way to 1063 (luma-).
The scientists take exception to the computer geeks using their decimal prefixes for binary amounts. 220 is not 106, and the inaccuracy gets quite large by the time you are discussing exabytes of data. There is a system of binary prefixes (gibibytes, anyone?), but they have not caught on yet, and sound silly, so I don’t think they ever will.
Comments
465 * 2^30 == 499,289,948,160
- Paddy.
True. And 2^20 isn't, either.
They're becoming more and more standard in operating systems and programs that deal with large amounts of data, for exactly the reason you state: the difference between power-of-two prefix and power-of-ten prefix gets large enough that the difference is confusing to the user. Consequently, I expect they'll catch on to the extent that the user encounters them in use, just as users originally learned the large power-of-ten prefixes.
Add a comment: