ä»ã®æè¡ã ãšãå®å šã«AIã ãã§ããŒã«ã«ïŒã¢ã«ãã©ïŒãæœåºããã®ã¯ãŸã ã¡ãã£ãšçŸå®çãããªããå€ãã®å Žåãæåã§ã®ä¿®æ£ãå¿ èŠã«ãªããã§ããããŒã¿éããååã«å€§ãããã°ãAIã§ããŒã«ã«ãæœåºããåŸã«åŠç¿ã«é©ãããªãŒãã£ãªããªããšãéããããšãã§ãããã
ãã®èšäºã§ã¯ãçŽ äººã§ãããã¯ãå®å šã«AIã䜿ã£ãŠåŠç¿çšãªãŒãã£ãªãéžå¥ããã¯ãŒã¯ãããŒã玹ä»ãããããã¹ãŠãªãŒãã³ãœãŒã¹ã®ãœãããŠã§ã¢ã䜿ã£ãŠããã
åŠç¿çšãšæšè«çšã®ãªãŒãã£ãª
AIã«ããŒçšã®ãªãŒãã£ãªã«ã€ããŠãæè¿ã®èªåã®ç ç©¶ããŸãšãããšã倧äœãåŠç¿ã¯é³è³ªã«å³ãããæšè«ã¯ãããïŒé³é«ïŒã«å³ãããã£ãŠèšããã
ã€ãŸããåŠç¿çšã®ãªãŒãã£ãªãšæšè«çšã®ãªãŒãã£ãªã¯åããŠèããã¹ããªãã ã
æšè«çšã®ãªãŒãã£ãªã§éèŠãªã®ã¯ããããã€ãŸããã³ãŒã©ã¹ïŒãã¢ãªïŒã®åŠçãããŸãã§ããŠããã°OKã§ãã±ãã±ããã€ã¹ïŒãªãŒããã¥ãŒã³ïŒããããã©ããã¯äºã®æ¬¡ã
ã§ããåŠç¿çšã®ãªãŒãã£ãªãšãªããšãã»ãŒå®ç§ãªã¯ãªãªãã£ã®ãã®ã䜿ãå¿ èŠããããã ã
é³è³ª
ãŸãã¯ã§ããã ãé«é³è³ªãªãªãŒãã£ãªãéžã³ããããã¹ããªã®ã¯ãã¹ã¬ã¹ã®é³æºã ãã©ãåŠçããããïŒèæ¯ãã€ãºãåŠçããããïŒãªãŒãã£ãªãéžã¶æ¹ããçµæçã«ããè¯ã广ãåŸããããããããªãã
å ·äœçãªé³è³ªã«ã€ããŠã¯ã alexkay/spek ãšãããœããã䜿ã£ãŠç¢ºèªã§ãããã
泚æç¹ïŒäžå¯éå§çž®ïŒææïŒã®é³æºããã¹ã¬ã¹ïŒç¡æïŒåœ¢åŒã«å€æããŠãé³è³ªã¯äžãããªããå®éã«ãœããã§èŠãã°äžç®çç¶ã ãã
ãããïŒé³é«ïŒ
åŠç¿çšã®ãªãŒãã£ãªã¯ãã§ããã ãå¹ åºããããç¯å²ãã«ããŒããŠããã®ãçæ³çãäžè¬çã«ãæ®éã®è©±ã声ã ãšãããªæãïŒ
- ç·æ§: 85Hz - 180Hz
- 女æ§: 165Hz - 255Hz
æãå Žåãªããçè«äžã¯ E2 (82Hz) ãã C5 (523Hz) ãŸã§ãã«ããŒããã®ãããããããã¡ããããã¡ã«ã»ããïŒè£å£°ïŒãå¿ èŠãªããäžéã800HzãŸã§åŒãäžããŠããããã
ããå¯èœãªãããããŒããããŒããšãã£ããã³ã°ããŒã³ã®é³å£°ã䜿ããšãã¢ãã«ã«æãå®å šãªF0ïŒåºæ¬åšæ³¢æ°ïŒã®é£ç¶çãªç¹åŸŽãæäŸã§ããããšãå€ãã
æè¿ã®äž»æµãªAIã«ããŒã¢ãã«ã¯ãããæœåºã«RMVPEã¢ã«ãŽãªãºã ã䜿ã£ãŠããŠãçè«çã«ã¯ãã£ã¡ã®æ¹ã广çãã§ããpraat ã䜿ã£ãŠå€§äœã®ãããã®ç®å®ããµã¯ããšç¢ºèªããããšãã§ããããŸãã¯äŸåã©ã€ãã©ãªãã€ã³ã¹ããŒã«ãããã
| |
ãããããpitch.py ãšãããã¡ã€ã«ãäœæããŠã以äžã®å
容ãå
¥åããŠãã
| |
ãã®ããã°ã©ã ã¯ãçŸåšã®ãã£ã¬ã¯ããªã«ãã audio ãã©ã«ãå
ã®ãã¹ãŠã®ãªãŒãã£ãªãã¡ã€ã«ã®ããããèªã¿åã£ãŠãã¿ãŒããã«ã«ãµããªãŒãåºåããããã«çŸåšã®ãã£ã¬ã¯ããªã«ãããã®ã°ã©ããæç»ããŠããããã
ãã¹ãŠã®ãªãŒãã£ãªãã¡ã€ã«ã audio ãã©ã«ãã«å
¥ãããã以äžã®ã³ãã³ããå®è¡ããŠãããã確èªãããã
| |
ããŒã«ã«ïŒã¢ã«ãã©ïŒæœåº
䜿çšãã nomadkaraoke/python-audio-separator ã䜿ããšãããªãå€ãã®ã¢ãã«ãå®è¡ã§ãããã
GPUçã®ã€ã³ã¹ããŒã«ïŒ
| |
ã¢ãã«ã®äžèŠ§ç¢ºèªïŒ
| |
çŸåšãå¹æãæ¯èŒçé«ãã®ã¯å€§äœ MelBand Roformer ã¢ãã«ããã®ã¢ãã«ã¯ audio-separator ã§ã¯ MDXC ã¢ãŒããã¯ãã£ã«å±ããŠããŠããã®ã¢ãŒããã¯ãã£ã«ã¯ããã€ãå
±éããŠäœ¿ãããã©ã¡ãŒã¿ããããã ã
--mdxc_segment_size=512: ã»ã°ã¡ã³ããµã€ãºãå€ã倧ããã»ã©ã¢ãã«ã®ã³ã³ããã¹ãçè§£åãåäžããçè«çã«ã¯å¹æãè¯ããªãã--mdxc_override_model_segment_size: ã»ã°ã¡ã³ããµã€ãºã匷å¶çã«å€æŽããã¢ãã«ã®ããã©ã«ãå€ãäžæžãããã--mdxc_overlap=8: äºæž¬ãŠã£ã³ããŠéã®ãªãŒããŒã©ããåæ°ãç¯å²ã¯ 2ã50 ã§ãçè«äžã¯é«ãã»ã©æ»ãããªçµæã«ãªãã--mdxc_batch_size=4: åæã«åŠçããæ°ãVRAMïŒãããªã¡ã¢ãªïŒã®å®¹éã«åãããŠèª¿æŽããŠãã--mdxc_pitch_shift=0: ãããã·ããïŒããŒå€æŽïŒèª¿æŽãéåžžã¯ããã©ã«ãã® 0 ã®ãŸãŸã§OKã
ã¡ãªã¿ã«ããã¡ããã¡ãé·ããªãŒãã£ãªãåŠçãããšãã¯ãããã€ãã®ãã£ã³ã¯ã«åå²ããŠåŠçããæ¹ãé床ãéããªãããšãå€ããã
| |
ããããã¢ãã«ã¯ã©ããã£ãŠéžã¹ã°ãããã ããïŒããã§ã¯ AliceNavigator/Music-Source-Separation-Training-GUI ãåèã«ããŠãã¢ãã«ãã䌎å¥é€å»ïŒããŒã«ã«æœåºïŒããã³ãŒã©ã¹é€å»ãããªããŒãé€å»ããããŠããã®ä»ïŒãã€ãºé€å»ããã¬ã¹ã»ã¯ãªãã¯ãã€ãºé€å»ãªã©ïŒãã®4ã€ã®ã«ããŽãªã«åããŠã¿ããã
å ·äœçãªã¢ãã«éžã³ã¯ãã¢ãã«ã®SDRå€ïŒä¿¡å·å¯Ÿé鳿¯ïŒãåèã«ãããšãããçè«äžã¯é«ããã°é«ãã»ã©å¹æãè¯ããããã€ããããã of ã¢ãã«ã玹ä»ãããã
䌎å¥é€å»
ããŒã«ã«ãæœåºããå Žåãäžè¬çã«ã¯ãRoformer Model: MelBand Roformer Kim | FT 3 by unwaããåªç§ã ããå ·äœçãªäœ¿ãæ¹ã¯ä»¥äžã®éãïŒ
| |
åŠçãçµãããšããã¡ã€ã«åã« vocals ãšä»ãããã®ãæœåºãããããŒã«ã«ã«ãªãããããæ¬¡ã®ã¹ãããã®åŠçã«åããã
ã³ãŒã©ã¹ïŒãã¢ãªïŒé€å»
æ²ã«ãã£ãŠã¯2人以äžã®æå£°ãå ¥ã£ãŠããããšãããããããã®å Žåã¯ã³ãŒã©ã¹é€å»ã¢ãã«ã䜿ã£ãŠãã¡ã€ã³ããŒã«ã«ã®æå£°ã ããåãåºãå¿ èŠããããäžè¬çã«ã¯ãKaraokeãã¢ãã«ã·ãªãŒãºã䜿ããšè¯ããŠãäŸãã°ãRoformer Model: MelBand Roformer | Karaoke V2 by Gaboxãããããããäœ¿ãæ¹ã¯ãã¡ãïŒ
| |
åŠçãçµãã£ãããvocals ãšä»ããŠãããã®ãã¡ã€ã³ããŒã«ã«ã®é³å£°ã ããããããæ¬¡ã®ã¹ãããã«é²ããŠãã
ããšãããç·å¥³ã®ããã«ã¡ã€ã³ããŒã«ã«ãªãããRoformer Model: BS Roformer | Chorus Male-Female by Sucialããšããã¢ãã«ã詊ããŠã¿ãã®ããããã¢ãã«ãã¡ã€ã«å㯠model_chorus_bs_roformer_ep_267_sdr_24.1275.ckpt ã ãã
ãªããŒãïŒæ®é¿ïŒé€å»
ããã«ããŒæ²ãäœãã®ãç®çãªãããRoformer Model: MelBand Roformer | De-Reverb by anvuewãã䜿ããã
ã§ããã¢ãã«ã®åŠç¿ïŒãã¬ãŒãã³ã°ïŒçšãªããã¢ãã©ã«çã®ãRoformer Model: MelBand Roformer | De-Reverb Mono by anvuewãã䜿ãã®ãããããã
ãªãããšãããšãçŸåšã®AIã¢ãã«ã¯åŠç¿æã«äžåŸã¢ãã©ã«ãªãŒãã£ãªãæ¡çšããŠãããããªãã ãããã¹ãã¬ãªïŒ2ãã£ã³ãã«ïŒãå ¥åãããšãå·Šå³ã®äœçžå·®ã®ããã§åŠç¿æã«ãã€ãºãå ¥ã蟌ãã§ããŸãå¯èœæ§ããããããªãã ããã
| |
noreverb ãšä»ããŠãããã¡ã€ã«ãããªããŒããé€å»ãããé³å£°ã ãã
ãã®ä»ã®ã¢ãã«
äŸãã°ãã€ãºé€å»ã¢ãã«ãªãããRoformer Model: Mel-Roformer-Denoise-Aufr33ãã䜿ã£ãŠãã€ã¯ã®ãã€ãºãç°å¢ã®åºãã€ãºïŒãã¯ã€ããã€ãºãªã©ïŒãæ¶ãããšãã§ããã
| |
ä»ã«ã¯ããã¬ã¹ãæ°æ³¡é³ïŒã¢ã¹ãã¬ãŒã·ã§ã³ïŒãé€å»ããã¢ãã«ãRoformer Model: MelBand Roformer | Aspiration by Sucialããªããããããã¢ãã«ãã¡ã€ã«ã¯ aspiration_mel_band_roformer_sdr_18.9845.ckptã
ãã®ä»ã®ã¢ãŒããã¯ãã£
ãã€ãºé€å»ã«é¢ããŠèšããšã鳿¥œä»¥å€ã®é³å£°ïŒåã声ãªã©ïŒãªã DeepFilterNet3 ãšããã¢ãã«ã广çãããããªããæè»œã«äœ¿ããã®ã
Shuichi346/DeepFilterNet3-VST3
ãããã¯DAWçšã®ãã©ã°ã€ã³ãªã®ã§ã䜿ãã«ã¯DAWãã€ã³ã¹ããŒã«ããå¿
èŠããããããšãäœè
ã¯MacOSçããé
åžããŠããªãã®ã§ãä»ã®OSïŒWindowsãªã©ïŒã䜿ãå Žåã¯èªåã§ãã«ãããå¿
èŠããããã
DAWã«ã€ããŠã ãã©ã REAPER ãããããããã®ãœããã¯å ¬åŒã§ç¡å¶éã®è©äŸ¡å©çšãã§ããã®ã§ãå®è³ªç¡æã§äœ¿ãããã ã
VST3ãã©ã°ã€ã³ããã«ãããã«ã¯ããŸãRustãã€ã³ã¹ããŒã«ããå¿
èŠãããã
Rustup-init
ãããŠã³ããŒãããŠå®è¡ããéžæè¢ãåºãã 1 ãéžãã§ããã©ã«ãã®ãŸãŸé²ããã°OKãéäžã§Visual Studioã®ããŠã³ããŒããæ±ãããããã©ããªãã·ã§ã³ã¯å€æŽããã«ãã®ãŸãŸå
šéšããŠã³ããŒããã¡ãã£ãŠå€§äžå€«ã
ããŠã³ããŒããçµãã£ãããããŒã«ã«ã«ãããžã§ã¯ããã¯ããŒã³ããŠãã£ã¬ã¯ããªã«ç§»åãããã
| |
ãã«ãã®éå§ïŒ
| |
ãã«ããå®äºããããtarget ãã©ã«ãå
ãäœéå±€ãæœã£ãŠ deepfilter-vst.vst3 ãã¡ã€ã«ãèŠã€ããããã C:\Program Files\Common Files\VST3 ãã©ã«ãã«ã³ããŒããŠãã
REAPERãéããŠããªãŒãã£ãªãã¡ã€ã«ããã©ãã°ïŒããããããå·ŠåŽã®ãFXããã¿ã³ãã¯ãªãã¯ããŠæ€çŽ¢ã»è¿œå ããã°äœ¿ããããã«ãªãã
å人çã«ã¯ãå®éã«äœ¿ã£ãŠã¿ãæãã ãšãããŸã§å¹æãè¯ããšã¯æããªãã£ãïŒå¿ èŠãªé³ãŸã§æ¶ãã¡ããããšãå€ãïŒããŸãã詊ããŠã¿ã䟡å€ã¯ãããããã
é³å£°ã®ããŒãã©ã€ãºïŒèŠæ ŒåïŒ
ã¢ãã«åŠç¿ã«äœ¿ãé³å£°ã®æå€§ããŒã¯é³éã¯ã-3dB ãã -6dB ã®éã«åããã®ããã¹ããé«ããããšé³å²ãã®åå ã«ãªã£ã¡ããã
éã«ã-40dB以äžã®éšåã¯æ¬¡ã®ã¹ãããã®ãã¹ã©ã€ã¹ïŒåãåºãïŒãã§ã«ãããããŠããŸããã ããããã®æ®µéã§ãªãŒãã£ãªã®æé«é³éã -3dB ã«ããŒãã©ã€ãºããŠããã®ãããããã
äŸåããã±ãŒãžã®ã€ã³ã¹ããŒã«ïŒ
| |
以äžã®ã³ãã³ãã䜿ããšãçŸåšã®ãã£ã¬ã¯ããªã«ãããã¹ãŠã® .wav ãã¡ã€ã«ãåŠçãããããŒãã©ã€ãºããããã¡ã€ã«ã normalized ãã©ã«ãã«ä¿åãããããäºåã« normalized ãã©ã«ããäœæããŠããã®ãå¿ããªãã§ãã
| |
ã³ãã³ãã®ç°¡åãªè§£èª¬ã¯ãããªæãïŒ
-nt peak:nt㯠Normalization TypeïŒããŒãã©ã€ãºã®çš®é¡ïŒã®ããšã§ãããã§ã¯ããŒã¯å€ãæå®ããŠããã-t -3: ã¿ãŒã²ããå€ã -3dB ã«èšå®ã-ext wav: åºåãã©ãŒããããwavã«æå®ã-o: åºåå ãã©ã«ãã
ããã§ãåŠç¿ã«äœ¿ããã¹ãŠã®ãªãŒãã£ãªã®æå€§é³éã -3dB ã«åäžåããããã
ã¹ã©ã€ã¹ïŒåå²ïŒ
flutydeer/audio-slicer ã䜿ããšããªãŒãã£ãªãèªåçã«åå²ã§ããããã®äžããèŽããŠã¿ãŠèªç¶ãªãªãŒãã£ãªãéžãã§ãããã
ã¢ãã«åŠç¿çšã«ãèŽãå¿å°ã®è¯ãã¯ãªãŒã³ãªãã€ã¯ãéžã³åºãããåã¯ãªããã¯çããŠã2ç§ä»¥äžãã§ããã°4ç§ä»¥äžããã®ããã¹ãã
åèšæéã¯å€§äœ10åã30åãããã°ååãæå€§ã§ã2æéãè¶
ããªãããã«ããããéžå¥ãçµãã£ãããPowerShellç°å¢ã§ä»¥äžã®ã³ãã³ããå®è¡ããŠãçŸåšã®ãã¹ãŠã® .wav ãã¡ã€ã«ã®åèšæéã確èªã§ãããã
| |
éžå¥ïŒã¯ãªãŒãã³ã°ïŒ
åå²ããããªãŒãã£ãªããããããããã€ã¹ã®ãããªæ©æ¢°é³ãæé€ããèªç¶ãªå£°ã®ãã€ã¯ã ããæ®ãã foobar2000 ãšãããœããã䜿ããšãäœèšãªãªããŒããªã©ãäžåéããã«åçã§ããããããã¯ãã¡ãèŽããŠããé³ããã®ãŸãŸãã¢ãã«ãå®éã«èŽãé³ãã«ãªããã ã
ããŠã³ããŒãããŠã€ã³ã¹ããŒã«ããããã¬ã€ã¢ãŠãïŒMain LayoutïŒãèšå®ããããã§ããã ããã¬ã€ãªã¹ãã倧ãã衚瀺ããããã®ïŒäŸãã° Slim View + Tabs ãªã©ïŒãéžã¶ã®ãããããã
- åºåããã€ã¹ã®éžæ
Ctrl+P ãæŒããŠèšå®ç»é¢ãéããPlayback -> Output ã® Device ã§ exclusiveïŒæä»ã¢ãŒãïŒãšæžãããŠããããã€ã¹ãéžæããã
- çŽ æ©ãåé€ããããã®ã·ã§ãŒãã«ããããŒãèšå®
èšå®ç»é¢ã® Keyboard Shortcuts ã§æ°ããã·ã§ãŒãã«ããã远å ãããAction ã®ãšããã§ delete ãæ€çŽ¢ãã[context]->File Operations->Delete file(s) ãéžæããããã㊠Key ã®ãšããã§ããŒïŒäŸãã° Ctrl+DïŒãç»é²ããã
èšå®ãçµãã£ããããã¹ãŠã®ãªãŒãã£ãªããã¬ã€ãªã¹ãã«ãã©ãã°ïŒããããããŠãåçããªããéžå¥ãéå§ãããã
ããšããïŒææ³ïŒ
ä»ã®AIã®å®åãèŠãéãããŸã åœåã¯ãä»äºã奪ãããããªããŠå¿é ã¯ããªããŠãããããå°ãªããšããªãŒãã£ãªã®åéã«ãããŠã¯ãAIã¯ãŸã ã䟿å©ãªããŒã«ãã®æ®µéã«çãŸã£ãŠãããäœæ¥å¹çãäžããã®ã«ã¯åœ¹ç«ã€ãã©ããã¯ã¿ãããªå®å šãªçŽ äººãããŒã«ã䜿ã£ãã ãã§ãããªãå®ç§ãªäœåãäœãããããããªããããã
æè¡ã¯ã©ãã©ãé²åããŠããã ãããã©ããã£ã±ãåŸæ¥ã®ããŒã«ã䜿ã£ãçµéšããã人ã®äŸ¡å€ã¯ãããç°¡åã«ã¯ä»ã§ä»£çšã§ããªããã®ã ãšæããªã