And with csv you just gotta pray that you're parser parses the same as their writer..and that their writer was correctly implemented..and they set the settings correctly
Hmm, not so sure. He produced a digital signal, who's spectrogram happened to be an image, and then played that digital signal to a bird. Dunno if a analogue spectrogram really even makes sense as a concept. The only analogue part of the chain would be the birds vocalisations, right?
Yup, that's what I was alluding to, while it may not still be the case for transistors, they did manage to take 50 odd years to get there, push that trend line from the figure 50 years heh (not saying you should, 5 seems much more conservative)
Can you go into a bit more details on why you think these papers are such a home run for your point?
Where do you get 95% from, these papers don't really go into much detail on human performance and 95% isn't mentioned in either of them
These papers are for transformer architectures using next token loss. There are other architectures (spiking, tsetlin, graph etc) and other losses (contrastive, RL, flow matching) to which these particular curves do not apply
These papers assume early stopping, have you heard of the grokking phenomenon? (Not to be confused with the Twitter bot)
These papers only consider finite size datasets, and relatively small ones at that. I.e. How many "tokens" would a 4 year old have processed? I imagine that question should be somewhat quantifiable
These papers do not consider multimodal systems.
You talked about permeance, does a RAG solution not overcome this problem?
I think there is a lot more we don't know about these things than what we do know. To say we solved it all 2-5 years ago is, perhaps, optimistic
Something makes me uneasy about this being a Google sheet, you need to use credentials to view it and someone has a log of who has accessed it..you can probably even see who's viewing it in realtime
Use an anonymous account! Or someone should host this on a website or something with higher privacy
which Blockchain are we talking here? How does it compare to the current banking infrastructure?
again, which one? How does it compare to the current pricing?
escrow is a thing, someone can build up a PayPal equivalent on top of a Blockchain, the list goes on
the current system doesn't do great here, some Blockchains makes it way more traceable, in fact
skill issue, but also solvable with a PayPal equivalent
not a fact, what does this even mean?
does it?
You could say the Linux kernel is an astronomically terrible idea because it doesn't do anything...but it is just the platform, the good comes from what people build on top of it that add all these quality of life features you miss
I don't really follow your logic, how else would you propose to shape the audio that is not "just an effect".
Your analogy to real life does not take into account that the audio source itself is moving, so their is an extra variable outside of just stereo signal -which is what spatial audio is modelling
And your muffling example sounds a bit over simplified maybe? My understanding is that the spatial stuff is produced by phase shifting the LR signals slightly
Finally why not go further? "I don't listen to speaker audio because it's all just effects and mirages to sound like a real sound, what only 2^16 discrete positions the diaphragm can be in" :p
Nahh your nitpicking there, large csvs are gonna be compressed anyways
In practice I've never met a Json I cant parse, every second csv is unparseable