[pandas] web scraping

티스토리 뷰

공부 (@Deprecated)

[pandas] web scraping

승가비 2022. 8. 13. 02:08

728x90

pip install lxml html5lib beautifulsoup4

import pandas as pd

url = 'https://en.wikipedia.org/wiki/History_of_Python'
dfs = pd.read_html(url)

print(len(dfs))

print(dfs[0]['Version'])
print(dfs[0]['Release date'])

# Load pandas

import pandas as pd

# Webpage url                                                                                                               
url = 'https://en.wikipedia.org/wiki/History_of_Python'

# Extract tables
dfs = pd.read_html(url)

# Get first table                                                                                                           
df = dfs[0]

# Extract columns                                                                                                           
df2 = df[['Version','Release date']]
print(df2)

# Write to excel
df2.to_excel('python.xlsx')

https://pythonbasics.org/pandas-web-scraping/

Pandas Web Scraping - Python Tutorial

Pandas makes it easy to scrape a table ( tag) on a web page. After obtaining it as a DataFrame, it is of course possible to do various processing and save it as an Excel file or csv file. In this article you’ll learn how to extract a table from any webpa

pythonbasics.org

728x90

저작자표시 비영리 (새창열림)

'공부 (@Deprecated)' 카테고리의 다른 글

[kotlin] public static void main (0)	2022.08.18
[gradle] jar > archive contains more than 65535 entries. (0)	2022.08.17
[Scala] SparkSession.conf(String, Any) (0)	2022.08.13
[kotlin] map -> flatMap (0)	2022.08.11
Spring Webflux + JDBC(혹은 blocking call) 핸들링 방법 (0)	2022.08.11

250x250

공지사항

최근에 올라온 글

최근에 달린 댓글

Total

Today

Yesterday

링크

TAG more

« 2025/12 »
일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

글 보관함

배우기를 멈추는 사람은 20세건 80세건 늙은 것이다.

티스토리 뷰

[pandas] web scraping

'공부 (@Deprecated)' 카테고리의 다른 글

티스토리툴바