如何从大量的wav文件中使用tensorflow.data.Dataset API创建数据集？

import tensorflow as tf import soundfile filepaths = tf.gfile.Glob('michael/dataset/wav_filepaths/*.wav') #Get the files into a list labels = get_labels #pseudo function to obtain corresponding labels to audio raw_audio = [] #List to hold raw audio lists. These are 2 channel wavs so this will be a 3D list #Create a list were each element is raw audio data for f in filepaths: try: data, sample_rate = soundfile.read(f) #2 channels raw_audio.append(data.tolist()) except Exception as err: #Poor practice to catch all exceptions like this but it is just an example print ('Exception') print (f) training_set = tf.data.Dataset.from_tensor_slices((raw_audio, labels))

2条回答

网友

1楼 · 编辑于 2024-06-17 19:42:47

您可以尝试使用一个generator函数将数据输入到pipline中。看看https://www.tensorflow.org/api_docs/python/tf/data/Dataset#from_generator

网友

2楼 · 编辑于 2024-06-17 19:42:47

虽然理论上可以用^{}读取文件，并用^{}对其进行解码，但这种情况下通常的方法是将数据转换为TFRecord格式，然后用^{}读取。This blog post显示了一个如何做到这一点的示例，在您的例子中，您需要一个脚本来读取每个WAV文件，对其进行解码并在文件中写入样本向量（我认为32位值是最简单的方式）。注意，如果要将多个音频文件批处理为一个张量，要么它们的大小必须相同，要么必须使用^{}来形成适当的张量。在

相关问题更多 >

编程相关推荐

热门问题

热门文章