如何使用Tensorflow 2.x从Google Audioset数据集中提取音频嵌入？

问题描述

我正在尝试从 Google Audioset dataset的TFRecord文件中提取音频嵌入，但是无法获取。

到目前为止，我只能使用SequenceExample格式将tfrecord文件解析为字符串，但我只需要audio_embeddings的功能序列

我正在使用 TensorFlow 2.3.0 以及我发现的有关Tensorflow 1.x的所有相关问题；在尝试compat.v1函数的错误消息中，我似乎得到了急切的执行错误和tf.data API使用建议。

这是我正在处理的代码：

tfr_file = tf.data.TFRecordDataset('10.tfrecord')
context_features = {"video_id": tf.io.FixedLenFeature([],tf.string),"labels": tf.io.VarLenFeature(tf.int64)}
sequence_features = {"audio_embeddings": tf.io.FixedLenSequenceFeature([10],dtype=tf.string)}

for raw_feature in tfr_file:
    example = tf.train.SequenceExample()
    example.ParseFromString(raw_feature.numpy())
    contexts,features = tf.io.parse_single_sequence_example(example,context_features,sequence_features)
    print(contexts)
    print(features
    print('--------')

错误：

Attempt to convert a value (context 
..................
.........
.........
with an unsupported type (<class 'tensorflow.core.example.example_pb2.SequenceExample'>) to a Tensor.

如果您需要其他任何信息，请告诉我。

解决方法

暂无找到可以解决该程序问题的有效方法，小编努力寻找整理中！

如果你已经找到好的解决方法，欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@）

dataset tensorflow tfrecord