Thank you very much for your contribution and for sharing it. I have always been curious about the evaluation metrics for co-speech, and I would like to ask whether the test datasets used for the metrics in your paper are the same as the ones used for Camn. I noticed that the test datasets in your code are somewhat different from Camn in terms of LMDB loading. If you could spare some time to answer this, I would be very grateful.