3 hours ago by svat
If you've never seen it before, this is a beautiful blog post on related matters: "Optimising pointer subtraction with 2-adic integers" http://blog.sigfpe.com/2010/05/optimising-pointer-subtractio...
2 hours ago by kevinventullo
A few months back I wrote a blog post very much inspired by Danās post, though a little more academic: https://kevinventullo.com/2020/12/21/2-adic-logarithms-and-f...
The idea is to exploit deeper structures in p-adic number theory to develop useful and novel algorithms with unsigned integer arithmetic, e.g. fast integer exponentiation. Feedback welcome!
(Warning: math heavy)
an hour ago by thechao
> I don't know how gcc generates its approximate 2-adic reciprocals. Possibly it uses something based on the Euclidean GCD algorithm. I wasn't able to find the precise line of source in a reasonable time.
It's 0x1FFF...F / 7 = 0x249...249249249.
Just about the oddest thing I've seen; I guess, if I were the author, it'd be my "1 in 10000" day?
29 minutes ago by xyzzy_plugh
Makes more sense in binary:
0xFFF..F 111111111111111111111111
0x7 000000000000000000000111
0x249..9 001001001001001001001001
3 hours ago by chrchang523
See also https://libdivide.com/ , when you need to repeatedly divide by a constant that isn't known until runtime.
2 hours ago by tragomaskhalos
Genuine question: why would any compiler bother to optimise division with such obtuse dividends, other than a purists' satisfaction ?
an hour ago by Denvercoder9
There's no specific optimisation for these specific numbers. There's a more general optimisation to turn division by a fixed number into a multiplication and shift where possible (because division is a lot slower than multiplication), and there is another optimisation that eliminates useless shifts. Combined they yield this result for these specific numbers.
7 minutes ago by joe_the_user
Yeah, a more detailed description of this general method seems missing from the article. Anyone care to provide it?
2 minutes ago by chrchang523
See the "Labor of Division" blog posts by the author of libdivide: https://ridiculousfish.com/blog/posts/labor-of-division-epis...
2 hours ago by warkdarrior
Division is orders of magnitude slower than multiplication.
an hour ago by Tuna-Fish
Less than one order of magnitude these days. On Tiger Lake, integer multiply is 3 cycles, while integer division is 12 for 32-bit integers and 15 for 64-bit.
Of course, the multiply is fully pipelined while the division is only partially, but even that leaves only a 10-fold difference.
Division is much faster these days than it used to be. It's still worth it to have compiler optimizations for it, but it's not important to avoid it anymore.
an hour ago by martincmartin
They're often special cases of general algorithms. You can do many divisions as combinations of multiplications & shifts; in this case, the resulting code simplifies down to the smallest possible.
an hour ago by andrewfromx
i'd like to find these numbers for base9 vs base10. see https://stackoverflow.com/questions/67226727/is-there-a-way-... if you look at that sample golang code, is that the right way to go about this? Or is there a much simpler way already built into a high level language like go?
18 minutes ago by AnotherGoodName
I'll run through a base 2 example and you should be able to extrapolate for any number
eg. find the equivalent multiplier of 1/31 in 8bits. This must exist because 31 has co-prime factors to 2^8 (the rules of a solvable diophantine require co-prime factors and we use that below).
Now the goal is to find a number that when multiplied by 31 and mod 2^8 = 1. This will give us a multiplicative equivalent to 1/31.
We can use the diophantine equation. 31x - 256y = 1. This equation asks 'what multiplied by 31 and subtracted by any number of 2^8 gives us 1?'. ie. It's asking what multiplier = 1/31 % 2^8
(I'm not going to work through the diophantine equation here but it's a log(n) equation and easy to do but I'm lazy so using wolfram alpha which has the diophantine equation built in)
Solving diophantine(31x - 256y = 1) tells us 'x must be of the form 256n + 233'. So (256n + 223)*31 = 1 under modulo 256
So now we know whenever we see a division by 31 we can multiply by 223 (or 256n + 223 if you really want a larger multiplier) and we'll get a number that's equivalent to 1/31 under modulo 2^8. You can do the above process for any number under any base as long as the number is co-prime to the base.
3 hours ago by jvickers
In JavaScript (specifically node / V8) if dividing multiple values by the same divisor, is it fastest to calculate the reciprocal and then multiply it?
How about other languages (and compiler systems)?
Are there binary tricks worth knowing about to quickly calculate reciprocals?
2 hours ago by brrrrrm
It's typically faster to calculate a reciprocal (especially at compile time) and then multiply it, but this is often at the expense of precision. In C++, for example, the -O3 flag on gcc will still emit a division (divss) to preserve compliant floating point precision in this example: https://godbolt.org/z/PY3Wjno4P
However, when we enable `-ffast-math`, we permit the compiler to do a bit more aggressive optimization in this domain (trading off precision), and we see multiplication (mulss) is emitted: https://godbolt.org/z/KEb1nc43z
3 hours ago by layoutIfNeeded
>I call the corresponding divisors ideal
Not a particularly good name, as āidealā already has an established meaning in algebra: https://en.m.wikipedia.org/wiki/Ideal_(ring_theory)
41 minutes ago by withinboredom
This is computer science, not algebra.
8 minutes ago by Tainnor
algebra plays a huge role in CS, e.g. cryptography
3 hours ago by omgJustTest
Floating point, while convenient does have substantial drawbacks in the implementations of circuits that we have today. Efficiency gains like this somewhat expose facets that are normally hidden. It will be interesting to see the transition to either analog NN or quantum computers, which inherently have better representations of continuous values.
Edit: Additionally the serial vs parallel processing paradigm shift is a main advantage of the switch too! Queued systems seem to have severe scheduling limitations, which the real-world does not always inherently have.
3 hours ago by svat
The linked article is about integer division and has nothing to do with floating point, and in fact I can't see how this idea could be applied to floating-point arithmetic. Can it?
2 hours ago by wizzwizz4
Not directly, no. The underlying mathematics kind of can, but you'd need dedicated circuitry (or bit-hacking) for it because of the floating point parts, and it would fall over around subnormals.
3 hours ago by whalesalad
I have been seeing this Wordpress theme a lot lately and it is so bad the way the content is pushed so far to the right. On my 27" monitor the left margin of the text is essentially the center of the display.
3 hours ago by jlarocco
I have a 27" monitor also, and it looks fine to me.
2 hours ago by tzs
How about if the browser window is maximized?
What I get in that case is 23 cm for the darkish blue left sidebar and 36 cm for the light blue right area that contains the article. Within that right area, the article is in 19 cm white rectangle with just under 2 cm margins.
So, starting from the left here is what I see:
15 cm empty dark blue background,
6 cm of sidebar text on dark blue background,
2 more cm of empty dark blue background,
2 cm of empty light blue background,
A tad under 2 cm of empty white background,
About 14.5 cm of text on white background,
A tad under 2 cm of empty white background,
15.5 cm of empty light blue background.
It really is quite ugly. It looks like something I would do because I completely suck at CSS.
28 minutes ago by dhosek
I almost neverĀ¹ run a browser maximized on anything bigger than my 16" laptop screen (and even that not so much). Most of the time, I have browser windows that are 1/2 (on my 24" monitor) or 1/3 (on my ultrawide monitor) of the width and the full height (2/3 width on my laptop)).
1 The exception being browser windows to run sites like Netflix where I'm really using it as a video interface.
3 hours ago by tachyonbeam
27" monitor on Chrome here. Looks pretty far to the right.
Daily digest email
Get a daily email with the the top stories from Hacker News. No spam, unsubscribe at any time.