Woah, thank you for using my little web app in this video! I've fixed a few bugs since this video came out, but if anyone has any issues feel free to hit me up!
@jordoneaton7083
3 жыл бұрын
Yes, where can I find this app?
@jaczob666
3 жыл бұрын
@@jordoneaton7083 Description man directmusic.me/wav2png/
@jordoneaton7083
3 жыл бұрын
@@jaczob666 Thank you. My screen narrator has been glitchy lately and appears to have missed that.
@tjwebb7428
3 жыл бұрын
Do you have this up on GitHub or anywhere?
@himagnamukherjee9382
3 жыл бұрын
You really have to make this a VST
@axman6815
3 жыл бұрын
Ah, music to my eyes 😅
@ygnoen6685
3 жыл бұрын
Cursed
@Euphoric_Brunette
3 жыл бұрын
Now we can say "I can see music notes"
@kuzmannymusic
3 жыл бұрын
🤣
@hirano9383
3 жыл бұрын
@@Euphoric_Brunette perfect pitch In the eyes
@Nugginsworth
3 жыл бұрын
OW IT SOUNDS SO BRIGHT
@dylanlockemp3
3 жыл бұрын
this reminds me of throwing pngs into serum wavetable
@noface718
3 жыл бұрын
Or harmor
@Shrek_Has_Covid19
3 жыл бұрын
poo
@ig9te
3 жыл бұрын
Hello Dylan
@Spherey
3 жыл бұрын
how this website actually works is related to the way the golden record from voyager 1 and 2 works. how it’s related is because both ways the record’s encoded image audio and the way the image is converted into sound uses the same encoding technique. i used to have a hyper-fixation over this, which is how i know how it worked. let’s say you input an image with a height of 432 pixels. the waveform that it outputs is actually divided into 432 parts, with each part corresponding to one pixel row of the image. how the converter encodes each part of the waveform (which corresponds to one pixel row of the image as i said earlier) is by using the crests (aka peaks or high parts) and troughs (aka valleys or low parts) of the waveform as different brightnesses. crests corresponding to lighter colors of each pixel row, and troughs corresponding to darker colors of each pixel row. so the converter scans through each row of the image left-to-right and outputs them as one part of the waveform. the converter scans through the image and outputs them as a waveform until it completely finishes generating. this is how the images are converted into waveforms.
@RubyPiec
3 жыл бұрын
I put random images into Audacity
@cerulity32k
3 жыл бұрын
PNG and WAV files are probably the best formats and they are my favorite for image and sound. WAV format is just uncompressed bytes of sound. PNG uses 4 bytes of data per pixel (RGBA), and usually WAV uses 4 bytes characters as far as I know, so it's perfect conversion.
@elvanaslan4435
3 жыл бұрын
can we get a round of applause for the editing in this video!
@RegahP
3 жыл бұрын
You should've tried changing the hue of the image
@adicsbtw
3 жыл бұрын
I think that the best explanation I have seen is that it reads left to right top to bottom, red channel is probably the left channel and green is the right channel. That would make the most sense to me. That would explain the popping sounds, the color of the image, and also means it would be hard to use image editing software to actually edit it due to the way it is formatted Edit: This is almost definitely how it works. If it was formatted differently that would make editing it much easier
@Villfuk02
3 жыл бұрын
The images are read pixel by pixel from left to right, top to down, like when reading text. This means that when you stretch something to be two lines instead of one, you repeat it twice. If you wanted to stretch a sound to be twice as long, assuming it takes up only one line, you have to make the line twice as wide, keeping the first half where it is and wrapping the second half onto the next line. Let me illustrate with text. this line contains a snare: ___SNARE__ stretching it vertically gives you two of them: ___SNARE__ ___SNARE__ stretching how I described it: _____SSNN_ _AARREE___
@Jopulis
3 жыл бұрын
Ooh, yeah... I feel like the colors mean frequencies or something about the waveform played at that particular time, like dark = a low sound, bright = a high sound, but that doesn't make sense when there are multiple frequencies playing at once...
@Villfuk02
3 жыл бұрын
@@Jopulis It's actually just the individual samples of the sound, left/right channel as red, the other channel as green. So the frequency is how fast light and dark colours alternate. And the amplitude (volume) is the difference in their brightness.
@RedstoneMiner18
2 жыл бұрын
Hmm, Intersting
@molly-molly925
2 жыл бұрын
𝙎 𝙉 𝘼 𝙍 𝙀
@ZethKeeper
3 жыл бұрын
I can easily imagine Andrew Huang making music with that.
@goodsoup9895
3 жыл бұрын
This video was made with *red heart emoji*
@elliotsmelliot
3 жыл бұрын
it really was made with [ *red heart emoji* ] and it shows 😍
@EricE549
3 жыл бұрын
now i have some sounds to use in my bandcamp experimental album!
@cyantasks7129
3 жыл бұрын
4:20 (not intended) that would make a good sound for like a machine gun.
@DaniSC_l1
3 жыл бұрын
now you can save music to paper!
@ncndemonplayz4859
3 жыл бұрын
You gotta drop the full release of the first finished product at the end that was actually sounding good 🙌
@lonergothonline
3 жыл бұрын
have you found out about blob opera yet? I spent a couple days going through a bunch of covers people made with the 'experiment'. its an a.i powered choir.
@nikolasg5520
3 жыл бұрын
this could be used to hide a message in an ARG :D .
@raoufbensalem3417
3 жыл бұрын
That what i was thinking XD
@Kai_On_Paws_4298
2 жыл бұрын
Gave me an idea
@kwasinimako
3 жыл бұрын
Nobody: 3:39 Travis Scott: thats fire 🔥🔥
@TheSoundFXGuy
3 жыл бұрын
I use png to wav to add a visual watermark to my sound effects in the same way Mick Gordon made the pictures in the Doom soundtrack.
@MrGreenAKAguci00
3 жыл бұрын
You are crazy. I'm here for it.
@ITSH4WK
3 жыл бұрын
My new favorite genre is Gaussian Blur
@jummy0
3 жыл бұрын
other people have mentioned this, but the program just reads each pixel's brightness (left to right, then top to bottom) as a single sample, at 44100 Hz (44100 pixels per second). it's a lot more interesting to use the image editor as a synth rather than a filter, by creating the images from scratch. granted, this can lead to a lot harsher noise if you mess up, but you can make some really cool sounds out of it. just make the image width the inverse of the frequency you want (44100 Hz sample rate / 200 pixels width = about 220 Hz per line), draw some patterns in black and white, then blur it to act as a lowpass filter to avoid killing your ears
@jummy0
3 жыл бұрын
additionally, if your image editor has a "Make Tileable" filter, it will remove any buzzing between cycles [lines] and popping between loops.
@justcama
3 жыл бұрын
What the fuck, this is so cool!
@tim_means_heart
3 жыл бұрын
- Hey man, what's your DAW ? - Have you heard of MS Paint
@jaykay3561
3 жыл бұрын
You're a legend, you should make a skillshare course because you're amazing at this! I'd love to learn from you!
@FatherSonAndAlcohol
3 жыл бұрын
This is awesome and why are the images always orange???
@genericname3685
3 жыл бұрын
So this is what they mean by hearing images. Thank you sir
@hyphinx
3 жыл бұрын
i did this to one of my pngs and it electrified me lol
@silly_lil_guy
3 жыл бұрын
14:05 when you accidentally create an EATEOT track
@BooToob
3 жыл бұрын
I did this and I did content aware fill on a part of the png I erased and you could here other parts of the song blended in. It was super interesting.
@offchristianamr
3 жыл бұрын
this is such a brilliant idea! that’s so sick omg
@migats2160
3 жыл бұрын
I can clearly make some interesting effect with this
@michaelduff2382
3 жыл бұрын
I have that same shirt.... i get compliments on it every time i wear it. So... nice shirt! Lol
@thehonestdude1067
3 жыл бұрын
An assault both on the eyes and the ears. Magnificent 😂😂
@889.
3 жыл бұрын
we have glitchpop in our hands
@ultrablack7271
3 жыл бұрын
Could you try to use the best samples of your experiment here, to make some music? Would be interesting ;)
@Jobo47
3 жыл бұрын
that first guitar loop is can't love by trippie red 🤯
@Beatsbasteln
3 жыл бұрын
damn son!
@virus_iv3001
3 жыл бұрын
pretty cool
@nikolaudio
3 жыл бұрын
What happens if you were to flip an entire png horizontally what it would do to the sound?
@krawieck
3 жыл бұрын
10:08 this sounds very moist
@Zuion_Art
3 жыл бұрын
When he blur the image, I was like... "Hmm if I put video quality down to 144p It would probably become 144p HD"
@airfryer2793
6 ай бұрын
He should look into making ocsiloscope music.
@robertstroup3699
3 жыл бұрын
Now that you have tried PNGs you need to try PBNJs
@angusphilippe8789
3 жыл бұрын
wait thats the sample used in cant love by trippie redd
@gonza9467
3 жыл бұрын
great video!
@Eliqzar
3 жыл бұрын
Instant LoFi PNG
@astedroid
Жыл бұрын
now: make entire song using photo manipulation
@4uartaOnda
3 жыл бұрын
10:51 Minecraft's cave sounds...
@frogfan449
3 жыл бұрын
merzbow would love this website
@NealMiskinMusic
3 жыл бұрын
What happens when you make it not orange? What does purple sound like?
@NealMiskinMusic
3 жыл бұрын
Messing with the colours makes it sound Bit crushed. That makes sense.
@sergejsdarznieks321
3 жыл бұрын
i made a whole beat with this in ableton live 10 lite!
@nicnakpattywhack5784
3 жыл бұрын
making music using a minecraft world downlaod
@fungalwater3175
3 жыл бұрын
Me waiting for something interesting to happen ( = w = )
@nonetrix3066
3 жыл бұрын
Upscaling the audio with waifu2x was interesting
@raoufbensalem3417
3 жыл бұрын
? ??
@nonetrix3066
3 жыл бұрын
@@raoufbensalem3417 What do you not understand?
@raoufbensalem3417
3 жыл бұрын
@@nonetrix3066 what is waifu2x ?
@nonetrix3066
3 жыл бұрын
@@raoufbensalem3417 AI image up-scaling
@snadwichbcuzyes3359
3 жыл бұрын
it sounds like it would be something that plays when you enter a broken area in a video game-
@Shrek_Has_Covid19
3 жыл бұрын
print out the png then scan it and convert that to a wav
@SuperMarioSmellMyFinger
3 жыл бұрын
I wish I could try that but I'm on a IPad and it doesn't support my browser
@PnfrlEnm
3 жыл бұрын
I believe the way it works is each pixel's brightness represents the amplitude of a sample of audio, and it reads left to right like a book, so when he's copying and pasting vertical layers, theoretically it would be like repeating a line of text, and that's why it chops the sample rather than blurs it. I could be wrong though, but it makes the most sense to me. Edit: got further into the video, that also explains the distortion effect, because with more contrast, bright pixels get brighter and dark pixels get darker, which should also stretch the waveform in a similar way. It's kinda hard to explain, but I can sorta visualize how it's working.
@farmerchuck7294
3 жыл бұрын
I can explain it more simply: The X axis is frequency, the Y axis is time and the brightness of each pixel is velocity.
@stxnw
3 жыл бұрын
@@farmerchuck7294 wtf is velocity
@farmerchuck7294
3 жыл бұрын
@@stxnw It's basically how hard you play a note, it's in practically every DAW and it's kinda like volume but not exactly. I'm surprised someone can watch this guy without knowing what it is, but maybe you just started watching him.
@stxnw
3 жыл бұрын
@@farmerchuck7294 so its amplitude?
@farmerchuck7294
3 жыл бұрын
@@stxnw Pretty much
@X_TRMm
3 жыл бұрын
Yo you keep disappearing and appealing out of nowhere with great content 🔥🔥🔥
@Backfighter7O7
3 жыл бұрын
He is very appealing indeed!
@sootera7298
3 жыл бұрын
Task failed successfully
@progfox
3 жыл бұрын
he really makes grate con tent
@AidanChaz
3 жыл бұрын
Appearing
@kerbalis3298
2 жыл бұрын
yo i keep doin your mom
@kdizzle005
3 жыл бұрын
Of course here a challenge... Make a song out of pngs if that's even possible.
@sergejsdarznieks321
3 жыл бұрын
i already done it
@banananarwhal6591
3 жыл бұрын
@@sergejsdarznieks321 pics or it didn't happen
@onidaaitsubasa4177
3 жыл бұрын
It would also be cool to try to paint a full understandable picture with recognizable objects in the picture that make a song when converted to a wav file.
@them3ta_93
3 жыл бұрын
Can we just all appreciate the quality of your videos
@hadleykibblewhite4877
3 жыл бұрын
You should try converting audio to PNG to compressed jpg and back. Might be interesting.
@DafterHindi
3 жыл бұрын
There is a thing called databending where you open an image in a audio software and add effects it looks super trippy!
@dacolib
3 жыл бұрын
Im surprised you didnt try using random images or doodling on the image
@dacolib
3 жыл бұрын
or pure sounds, like sine/saw/square waves
@Kai_On_Paws_4298
2 жыл бұрын
@@dacolib I used a sine wave-
@carpet_appetite
3 жыл бұрын
0:01 omg the fucking nostalgia from the gta san andreas destination marker sound
@btarg1
3 жыл бұрын
1:50 why does that sound so damn good wow
@jaczob666
3 жыл бұрын
14:08 - That reminds me of scanning through radio stations sound.
@TCWTre
3 жыл бұрын
I can’t believe I’m watching this in the middle of class
@futureliink.
3 жыл бұрын
Your content is so different from other music producers. I love that!
@nixellion
3 жыл бұрын
I'm still watching, but the first thing I would do is convert the sound into a png and then BACK to audio without changes to make sure it even does that properly in the first place. Shakiness of audio might be just a png compression artefact or something like that
@Twat2024
3 жыл бұрын
Pretty true
@Kai_On_Paws_4298
2 жыл бұрын
I did it
@Kai_On_Paws_4298
2 жыл бұрын
It's not lol
@nixellion
2 жыл бұрын
@@Kai_On_Paws_4298 You mean it does not convert back to audio properly? :D Thought so
@VeralityCh
3 жыл бұрын
There's a function in Serum where you can use PNG images as wavetables
@alvarovalentin7001
3 жыл бұрын
In harnor in fl Studio you can do that too
@noface718
3 жыл бұрын
Its in the paid version of vital too I think
@raoufbensalem3417
3 жыл бұрын
@@noface718 you can try it in the free version i think
@noface718
3 жыл бұрын
@@raoufbensalem3417 nope Tested it You cant
@raoufbensalem3417
3 жыл бұрын
@@noface718 i think its text to speech not this
@waltwhitman7545
3 жыл бұрын
14:20 flipped all those layers and ended up sounding like a Blanck Mass song
@wyntrr_end
3 жыл бұрын
i think the weird stuttering you're experiencing, which you speculate at 3:55 is the sample rate, is due to the actual png resolution. i suspect that each one of those delay/echo effects is occurring with every pixel in the image, so if there were some way to increase the vertical resolution of the images that the converter program uses, you could have less choppy results.
@arcioko2142
3 жыл бұрын
what if the png resolution is the same as the sample rate
@wyntrr_end
3 жыл бұрын
@@arcioko2142 if the .png resolution was the sample rate, either the images would be much much taller or we wouldn't be able to see all the little oscillations in the resulting waveform, like at 6:59 we can clearly see the waveform's oscillations occur more quickly than the stutter effects, and based on how many of those oscillations fit across the screen at once, we can easily see how if there was even one pixel for each oscillation, the .png would be so much taller than it is
@arcioko2142
3 жыл бұрын
@@wyntrr_end oh ok
@ORyanMcEntire
3 жыл бұрын
It's because the audio is encoded into one single horizontal line of pixels that is then wrapped vertically. It should be read right to left, and then when you reach the end of the line on the right it continues on the next row on the far left. Think of it like reading this comment. If you did a vertical motion blur all you are doing is duplicating letters vertically across words in different lines. Example: This is a sentence about ducks. Quack! Gets incoded as: This is a sentence about ducks. Quack! Turns into: Tahbiosu t idsu cak sse.n tQeunaccek ! Tahbiosu t idsu cak sse.n tQeunaccek ! Which would turn back into audio as: Tahbiosu t idsu cak sse.n tQeunaccek ! Tahbiosu t idsu cak sse.n tQeunaccek ! Rather than: TTThhhiiiss iiisss aaa ssseeennnttteeennnccceee aaabbbooouuuttt ddduuuccckkksss... QQQuuuaaaccckkk!!! This is why everything got stuttery. Because he was blurring the sounds vertically across multiple rows of time. Even when blurring horizontally, the blur doesn't wrap with the pixels so the audio at the left and right edges gets messed up. If you could unwrap this image into a single horizontal row of pixels the blur would probably sound a bit more like reverb.
@wyntrr_end
3 жыл бұрын
@@ORyanMcEntire (with the exception of the motion blur on your ducks example) that actually makes a lot of sense. after experimenting with it a bit myself, I see that your explanation makes much more sense than what I said. interesting that this means there's no connection between sound frequencies and the x coordinate in the image (in the sense that the lower frequencies are not to the left of the higher frequencies or vice versa)
@kreblz
3 жыл бұрын
Omg I’ve ALWAYS wondered how this would work
@cupofdirtfordinner
3 жыл бұрын
Now do the reverse. In audacity, if you click "import raw audio" it will accept ANY file type as audio. Ive found using weird file types with weird data (.AVI, .blend, .apk, etc.) Gives the best results.
@hyperbeast4340
3 жыл бұрын
Good to know!
@EsportCat
3 жыл бұрын
I love how the song at end actually sounds pretty good lol, btw can you try making music in a video editor like premiere?
@natesalaa6810
3 жыл бұрын
do this but try changing the orange color completely to blue or green or something. that could be really interesting
@Kai_On_Paws_4298
2 жыл бұрын
It does nothing probably
@A_jbllover200
3 ай бұрын
1:36 sounds so lofi 3:39 *TRIPLE DRUMS* 7:01 poppy guitar and drums 10:59 DISTORTED BOI
@Spherey
3 жыл бұрын
how this website actually works is related to the way the golden record from voyager 1 and 2 works. how it’s related is because both ways the record’s encoded image audio and the way the image is converted into sound uses the same encoding technique. i used to have a hyper-fixation over this, which is how i know how it worked. let’s say you input an image with a height of 432 pixels. the waveform that it outputs is actually divided into 432 parts, with each part corresponding to one pixel row of the image. how the converter encodes each part of the waveform (which corresponds to one pixel row of the image as i said earlier) is by using the crests (aka peaks or high parts) and troughs (aka valleys or low parts) of the waveform as different brightnesses. crests corresponding to lighter colors of each pixel row, and troughs corresponding to darker colors of each pixel row. so the converter scans through each row of the image left-to-right and outputs them as one part of the waveform. the converter scans through the image and outputs them as a waveform until it completely finishes generating. this is how the images are converted into waveforms.
@3v068
3 жыл бұрын
You just gave me the perfect tool to make weird sounds for video games, and dubstep. I can not thank you enough for this video.
@zeno3062
3 жыл бұрын
i wonder what a picture of you would sound like XD
@Xatewn
3 жыл бұрын
that random sample is used in a song with 122M hahaha Rels B, Dellafuente - BUENOS GENES
@Solstici_
3 жыл бұрын
me he quedado igual al escucharlo JAJAJAJ
@dexterian477
3 жыл бұрын
I would actually love to see a part 2 to this video! That was awesome! ^_^
@ORyanMcEntire
3 жыл бұрын
I think the way you are assuming the audio got encoded as the image might be the reason the experiments didn't sound great. I'm pretty sure the audio is encoded into one single horizontal line of pixels that is then wrapped vertically. It should be read right to left, and then when you reach the end of the line on the right it continues on the next row on the far left. Think of it like reading this comment. If you did a vertical motion blur all you are doing is duplicating letters vertically across words in different lines. Example: This is a sentence about ducks. Quack! Gets incoded as: This is a sentence about ducks. Quack! Turns into: Tahbiosu t idsu cak sse.n tQeunaccek ! Tahbiosu t idsu cak sse.n tQeunaccek ! Which would turn back into audio as: Tahbiosu t idsu cak sse.n tQeunaccek ! Tahbiosu t idsu cak sse.n tQeunaccek ! Rather than: TTThhhiiiss iiisss aaa ssseeennnttteeennnccceee aaabbbooouuuttt ddduuuccckkksss... QQQuuuaaaccckkk!!! This is why everything got stuttery. Because you are blurring the sounds vertically across multiple rows selection of time stacked vertically. Even when blurring horizontally, the blur doesn't wrap with the pixels so the audio at the left and right edges gets messed up. If you could unwrap this image into a single horizontal row of pixels I bet the blur might sound a bit more like reverb.
@orfious
3 жыл бұрын
Even though the sounds were pretty garbage.. I know I will spend at least 2 hours making my very own garbage sounds
@Haydenex
3 жыл бұрын
I wonder if it reverses if you flip the image Edit: no it doesn't, it just makes its sound glitchy
@banananarwhal6591
3 жыл бұрын
10:06 "That's a bulgy boi" Beat proceeds to shart on everything.
@hyperbeast4340
3 жыл бұрын
Hmmmmmm
@xd-qi6ry
3 жыл бұрын
These faster uploads are amazing.
@Sol4rOnYt
3 жыл бұрын
12:26 laser gun yes
@PrincePyronius
3 жыл бұрын
What if you just changed the color?
@lancebeltran1811
3 жыл бұрын
Alternative title: *How the disk 11 is created*
@leonannaves9273
3 жыл бұрын
Just now i noticed how you look like T3ddy
@WildWolf-pu4pj
3 жыл бұрын
the noise added to the song when you first tried it, it sounded cool like it had a lofi-ish vibe
@FsKir
3 жыл бұрын
Literally sound desiigner
@csvscs
3 жыл бұрын
This is a really cool concept. I wonder what adding like visual distortion does to a guitar like does it actually distort it?
@Clumsy_the_24
3 жыл бұрын
12:22: (Gunshot noise) Levi: sounds kinda normal… You must live in America or something because it shouldn’t be a normal thing to hear anywhere else.
@geotube1379
3 жыл бұрын
He’s from New Zealand
@Kai_On_Paws_4298
2 жыл бұрын
To me it sounds epic
@Flumby_the_creator_YT
Жыл бұрын
AAIIIIIIIIIIIIIIIIIIIII
@maverickREAL
3 жыл бұрын
This could be crazy for making glitchy/weirdcore/hyperpop/experimental tracks
@KaitlinGaspar
3 жыл бұрын
WAIT THIS IS EVERYBTING IVE BEEN LOOKING FOR
@Marcmolemanold
3 жыл бұрын
fun fact, the TRH_guitar_loop_smooth_68_Em sample you dragged is the guitar that sounds on Rels B, Dellafuente - BUENOS GENES song xDDDDD
@berdnikoff
Жыл бұрын
*But was it possible? And I didn't even think that you can make a mashup not only in FL Studio 20, but also in wav2png :)*
Пікірлер: 392