About 133,000 results
Open links in new tab
  1. python - How to read a Parquet file into Pandas DataFrame

    192 How to read a modestly sized Parquet data-set into an in-memory Pandas DataFrame without setting up a cluster computing infrastructure such as Hadoop or Spark? This is only a …

  2. python - How to read a list of parquet files from S3 as a pandas ...

    Indeed, when the partitioned parquet files are stored to S3, they are usually first written to "_temporary" directory. If this directory not empty then it is a clear sign, that S3-location …

  3. Is it possible to read parquet files in chunks? - Stack Overflow

    Nov 29, 2019 · For example, pandas's read_csv has a chunk_size argument which allows the read_csv to return an iterator on the CSV file so we can read it in chunks. The Parquet format …

  4. python - Pandas : Reading first n rows from parquet file ... - Stack ...

    Dec 31, 2018 · I have a parquet file and I want to read first n rows from the file into a pandas data frame. What I tried: df = pd.read_parquet(path= 'filepath', nrows = 10) It did not work and gave …

  5. Read multiple parquet files in a folder and write to single csv file ...

    Aug 5, 2018 · 39 I am new to python and I have a scenario where there are multiple parquet files with file names in order. ex: par_file1,par_file2,par_file3 and so on upto 100 files in a folder. I …

  6. How to open huge parquet file using Pandas without enough RAM

    Feb 11, 2020 · 12 I am trying to read a decently large Parquet file (~2 GB with about ~30 million rows) into my Jupyter Notebook (in Python 3) using the Pandas read_parquet function. I have …

  7. How to read parquet files from Azure Blobs into Pandas DataFrame?

    Aug 11, 2020 · I need to read .parquet files into a Pandas DataFrame in Python on my local machine without downloading the files. The parquet files are stored on Azure blobs with …

  8. Get schema of parquet file in Python - Stack Overflow

    Oct 9, 2020 · Is there any python library that can be used to just get the schema of a parquet file? Currently we are loading the parquet file into dataframe in Spark and getting schema from the …

  9. pandas - Read a parquet bytes object in Python - Stack Overflow

    Sep 23, 2019 · 15 I have a python object which I know this is a parquet file loaded to the object. (I do not have the possibility to actually read it from a file). The object var_1 contains …

  10. How to read partitioned parquet files from S3 using pyarrow in …

    Jul 13, 2017 · Is there a way we can easily read the parquet files easily, in python from such partitioned directories in s3 ? I feel that listing the all the directories and then reading the is not …