KingRandomGuy

@ KingRandomGuy @lemmy.world

Posts

0
Comments

109
Joined

2 yr. ago

8mo ago

Deleted

Permanently Deleted
Jump
2

KingRandomGuy @lemmy.world 8mo ago

All of the “AI” garbage that is getting jammed into everything is merely scaled up from what has been before. Scaling up is not advancement.
I disagree. Scaling might seem trivial now, but the state-of-the-art architectures for NLP a decade ago (LSTMs) would not be able to scale to the degree that our current methods can. Designing new architectures to better perform on GPUs (such as Attention and Mamba) is a legitimate advancement. Furthermore, the viability of this level of scaling wasn't really understood for a while until phenomenon like double descent (in which test error surprisingly goes down, rather than up, after increasing model complexity past a certain degree) were discovered.
Furthermore, lots of advancements were necessary to train deep networks at all. Better optimizers like Adam instead of pure SGD, tricks like residual layers, batch normalization etc. were all necessary to allow scaling even small ConvNets up to work around issues such as vanishing gradients, covariate shift, etc. that tend to appear when naively training deep networks.

8mo ago

Beyond RGB: A new image file format efficiently stores invisible light data

I agree that pickle works well for storing arbitrary metadata, but my main gripe is that it isn't like there's an exact standard for how the metadata should be formatted. For FITS, for example, there are keywords for metadata such as the row order, CFA matrices, etc. that all FITS processing and displaying programs need to follow to properly read the image. So to make working with multi-spectral data easier, it'd definitely be helpful to have a standard set of keywords and encoding format.

It would be interesting to see if photo editing software will pick up multichannel JPEG. As of right now there are very few sources of multi-spectral imagery for consumers, so I'm not sure what the target use case would be though. The closest thing I can think of is narrowband imaging in astrophotography, but normally you process those in dedicated astronomy software (i.e. Siril, PixInsight), though you can also re-combine different wavelengths in traditional image editors.

I'll also add that HDF5 and Zarr are good options to store arrays in Python if standardized metadata isn't a big deal. Both of them have the benefit of user-specified chunk sizes, so they work well for tasks like ML where you may have random accesses.

KingRandomGuy

@ KingRandomGuy @lemmy.world

Posts

0
Comments

109
Joined

2 yr. ago

KingRandomGuy

Permanently Deleted

Beyond RGB: A new image file format efficiently stores invisible light data

Beyond RGB: A new image file format efficiently stores invisible light data

Permanently Deleted

Akira ransomware can be cracked with sixteen RTX 4090 GPUs in around ten hours — new counterattack breaks encryption

I want to branch out from PLA. Should I try ABS or TPU?

Apple reveals M3 Ultra, taking Apple silicon to a new extreme

Microsoft CEO Admits That AI Is Generating Basically No Value

Framework’s first desktop is a strange—but unique—mini ITX gaming PC

Permanently Deleted

Framework’s first desktop is a strange—but unique—mini ITX gaming PC

The science is divided

The science is divided

Permanently Deleted

Can I ethically use LLMs?

Arm's to launch first self-made processors, poaching employees from clients: Reports

DeepSeek Proves It: Open Source is the Secret to Dominating Tech Markets (and Wall Street has it wrong).

Time to get serious with E2E encrypted messaging

Time to get serious with E2E encrypted messaging

xkcd #3047: Rotary Tool