"Spectral noise elimination eliminates noise" for audio playback

"Spectral noise elimination eliminates noise"
https://www.ai-shift.co.jp/techblog/1305

After writing the audio data after separating the noise portion and the sound source from the site. I want to play the sound.On the program, Sound Envelop is the noise part,
The recovered_signal should be the voice after the sound source has been isolated.I think I can do it if I make these two into wav files. Please tell me how to do that.

The source code below contains some programs from the website.

!pip install librosa

import numpy as np
from scope.ndimage import maximum_filter1d

default(y, rate, threshold):
    """
    Args:
        - y —Signal data
        - rate —Sampling frequency
        - threshold —Threshold for noise determination
    Returns:
        - mask —Whether the amplitude is greater than or equal to the threshold.
        - y_mean —Sound Envelop
    """
    y_mean=maximum_filter1d(np.abs(y), mode="constant", size=rate//20)
    mask = [ mean > threshold for mean in y_mean ]
    return mask, y_mean

import librosa


n_ft = 2048 # Number of voice frames between STFT columns
hop_length = 512 # Number of voice frames between STFT columns
win_length=2048# Window Size
n_std_thresh=1.5# Threshold for how many standard deviations (dB of the mean at each frequency level) are greater than the mean of the noise to be considered a signal

def_stft(y,n_ft,hop_length,win_length):
    return librosa.stft(y=y, n_ft=n_ft, hop_length=hop_length, win_length=win_length)

def_amp_to_db(x):
    return librosa.core.amplitude_to_db(x,ref=1.0,amin=1e-20,top_db=80.0)



sample_rate = 32000

# Loading Audio Files
# noise_clip = open(r "C:\Users\1818067\birdvoice.wav")

path=r'C:\Users\1818067\birdvoice.wav'
sig,_=librosa.load(path,sr=sample_rate)

# noise data acquisition
mask, noise_clip = envelope(sig, sample_rate, threshold = 0.03)




noise_stft =_stft(noise_clip, n_ft, hop_length, win_length)
noise_stft_db =_amp_to_db(np.abs(noise_stft))# Convert to dB

mean_freq_noise=np.mean(noise_stft_db,axis=1)
std_freq_noise=np.std(noise_stft_db,axis=1)
noise_thresh = mean_freq_noise+std_freq_noise*n_std_thresh

import librosa
from envelope import envelope
import scipy


n_grad_freq = 2# Number of frequency channels to be smoothed by the mask
n_grad_time=4#Number of time channels to smooth using the mask
prop_decrease=1.0# How much noise do you reduce?


# data=open(r'C:\Users\1818067\birdvoice.wav')
#audio_clip=envelope(data).envelop

sample_rate = 32000

# Loading Audio Files
# noise_clip = open(r "C:\Users\1818067\birdvoice.wav")

path=r'C:\Users\1818067\birdvoice.wav'
sig,_=librosa.load(path,sr=sample_rate)

# noise data acquisition
audio_clip, rate=librosa.load('birdvoice.wav')


# A sound source is also extracted by STFT.
sig_stft =_stft(audio_clip, n_ft, hop_length, win_length)
sig_stft_db =_amp_to_db(np.abs(sig_stft))

# Create mask smoothing filters over time and frequency
smoothing_filter=np.outer(
        np.concatenate(
            [
                np.linspace(0,1,n_grad_freq+1,endpoint=False),
                np.linspace(1,0, n_grad_freq+2),
            ]
        )[1:-1],
        np.concatenate(
            [
                np.linspace(0,1,n_grad_time+1,endpoint=False),
                np.linspace(1,0, n_grad_time+2),
            ]
        )[1:-1],
    )
smoothing_filter=smoothing_filter/np.sum(smoothing_filter)

# Calculating Time and Frequency Thresholds
db_thresh=np.repeat(
        np.reshape (noise_thresh, [1, len(mean_freq_noise)],
        np.shape(sig_stft_db)[1],
        axis = 0,
    ) .T
sig_mask = sig_stft_db<db_thresh
sig_mask=scipy.signal.ftconvolve(sig_mask, smoothing_filter, mode="same")
sig_mask = sig_mask *prop_decrease

mask_gain_dB = np.min(_amp_to_db(np.abs(sig_stft)))

def_db_to_amp(x,):
    return librosa.core.db_to_amplitude(x,ref=1.0)

sig_stft_db_masked=(
        sig_stft_db*(1-sig_mask)
        + np.ones(np.shape(mask_gain_dB))*mask_gain_dB*sig_mask
)

def_istft(y,hop_length,win_length):
    return librosa.istft(y,hop_length,win_length)

sig_imag_masked=np.imag(sig_stft)*(1-sig_mask)
sig_stft_amp=(_db_to_amp(sig_stft_db_masked)*np.sign(sig_stft))+(1j*sig_imag_masked)

recovered_signal=_istft(sig_stft_amp, hop_length, win_length)

python python3

2022-09-30 11:05

2 Answers

The latest version of librosa does not have the file output function itself, as shown below, and says you should use a separate package called soundfile.

librosa 0.7.2
librosa.output.write_wav(path,y,sr,normal=False)

This function is deprecated in librosa 0.7.0. It will be removed in 0.8. Usage of write_wav should be replaced by soundfile.write.

librosa 0.8.1
Write out audio files

PySoundFileprovides output functionality that can be used directly with numpy array audio buffers:

SoundFile
soundfile.write(file,data,samplerate,subtype=None,endian=None,format=None,closefd=True)

Examples
Write 10 frames of random data to a new file:

>>import numpy as np
>>import soundfile as sf
>> sf.write('stereo_file.wav', np.random.randn(10,2), 44100, 'PCM_24')

Additional References:
WAV export on 16-bit in librosa

So, for example, to file recovered_signal, you can do the following:

import soundfile as sf

sf.write('clearvoice.wav', recovered_signal, 22050, subtype='PCM_16')

And to play the wav file, you can use one of these articles.
Handle sound in Python

For example, if playsound is first mentioned in the above article, the latest version will fail as mentioned in the previous question, so
I get an error when I try to play audio using playsound

After installing the previous version,

 pip install-Uplaysound==1.2.2

You can call them as follows.

from playsound import playsound

playsound('clearvoice.wav')

2022-09-30 11:05

When I run it, it becomes cannot import name 'envelope' from 'envelop' but
Could you show me the revised code?

2022-09-30 11:05

If you have any answers or tips

Popular Tags

python x 4647

android x 1593

java x 1494

javascript x 1427

c x 927

c++ x 878

ruby-on-rails x 696

php x 692

python3 x 685

html x 656