Hi! I think the problem is the format of the metadata file rather than the dataset size. You can test this with the following code:
import pyarrow.json as paj
table = paj.read_json("path/to/metadata_file")
You can find the correct metadata format here.
