Skip to end of metadata
Go to start of metadata

This function is one of Plugins Operation.You can find the movie in ARGOS RPA+ video tutorial.



pandas III

Author: Jerry Chae


Now capable of processing multiple input data files. (dataframes)

This is the third plugin in our pandas series.  This is similar to panda-II where users can execute their Python statement sequentially like Jupyter Notebook but automatically (without Jupyter Notebook) with pandas-III. The major difference is that pandas-III enables you to take multiple data files as input.

 

Primary Features

This plugin runs python statement(s) on pandas on multiple input data files (dataframes).

https://en.wikipedia.org/wiki/Pandas_(software)

https://pandas.pydata.org/pandas-docs/stable/getting_started/10min.html

 

Prerequisite

This plugin requires Python and Regular Expression skills.




Need help?

Technical contact to tech@argos-labs.com


May you search all operations,



(warning) Initial download maybe slow

Please note that the pandas solution is a large software using numerous Python machine learning sub-modules. The bot will take more than just a few minutes to download them to be ready. But this is just for the “first run”.  As to the second run on, the local VENV will be used to avoid downloading unless new pandas II version has been selected to replace what was in the bot originally.




Input, Output, Features, and Parameters.

Required Input

1. Output File: One data file.

            Supported input formats are .xlsm, .xls, xlsm, .csv, .tsv, and .json

2. Input Files: as many data files (dataframe)  as you would like to process.

            Supported input formats are .xlsm, .xls, xlsm, .csv, .tsv, and .json

Optional Input

3. Enter a Python statement, or multiple statements. Also a text file that contains a list of statements can be used as input.

4. When input file multiple sheets, you can select which sheet to be processed.

5. You can designate which row you can use as header (variable) for your processing.

6. You can specify a column to be used as the index of the dataframe.

7. You can specify which column(s) to be or not to be processed.

8. You can determine specific pandas datatypes for your column.

9. You can determine what character to use to separate your data (default is comma).

10. You can specify encoding technology of the input file (default is UTF-8).

11. You can select to either show or hide the index column in your output file.


How to set parameters


When handling multiple input data files, you must respect the input file sequence to set parameters for each one of the input files.



Text from Image 




pandas-III plugin parameters are 100% compatible to pandas read_excel specifications

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_excel.html

Please refer the parameters on the right in the pandas document above.


  • Sheet Name         →sheet_name
  • Header Row         →header
  • Index Col              →index_cols
  • Use Col                →usecols
  • Data Type            →dtypes



 Text from Image