reading only selected files from HDFS using arrow dataset cpp api

  apache-arrow, c++, hdfs

I have a special use case where I want to read only selected parquet files(list of files) from a directory on hdfs filesystem.
I know I can use whole directory as dataset and then apply filter on data, but I want to read only selected list of files.
How can I do that using arrow cpp api ?
Also if you can share an example of how to use hdfs filesystem to read parquet files using arrow cpp api.(where I can pass a list of files that I want to read).
Thanks in advance.

Source: Windows Questions C++

LEAVE A COMMENT