Metadata-Version: 2.1
Name: py-html-table
Version: 0.0.2
Summary: Python library to extract data from HTML Tables with rowspan
Home-page: https://github.com/harinathselvaraj/py_html_table
Author: Harinath Selvaraj
Author-email: harinath.selvaraj@outlook.com
License: UNKNOWN
Description: # py-html-table Package
        
        This is a simple package which uses beautifulSoup to extract HTML Table data along which has rowspan.
        
        ## Installation
        
        pip install 'beautifulsoup4==4.5.3'\
        pip install py_html_table\
        
        ## Declare
        
        import py_html_table\
        
        ## Parameters
        
        Parameter | Meaning | Sample Values
        ----------|---------|--------
        table | python variable containing html code of table | any variable name
        col | Total No.of columns in the table. Starts from 1 | 5
        begin | No.of rows to begin scrapping. Starts from 0 | 2
        output | Type of output that you need | list (or) dataframe (or) csv
        
        ## Usage Example
        
        import requests\
        from bs4 import BeautifulSoup\
        import requests_html\
        import lxml.html as lh\
        import py_html_table\
        
        url = 'https://en.wikipedia.org/wiki/List_of_Presidents_of_the_United_States'\
        session = requests_html.HTMLSession()\
        r = session.get(url)\
        content = BeautifulSoup(r.content, 'lxml')\
        all_tables = content.select(".wikitable")\
        
        col = 9          # Total No.of columns in the table. Starts from 1\
        begin = 2        # No.of rows to begin scrapping. Starts from 0\
        output = 'dataframe'  # options - list, dataframe, csv\
        raw = 'N'        # 'Y' -> complete HTML content. 'N' -> only text inside HTML\
        
        py_html_table.extract(table,begin,col,output,raw)
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
