SqueezeNews

SqueezeNews Package


License
Apache-2.0
Install
pip install SqueezeNews==0.0.1

Documentation

SqueezeNews 📖

🇺🇸 English | 🇰🇷 한국어

In today's modern society, a large amount of news is poured out in various fields such as politics, the economy, and society. There is no time to read all these vast amounts of news accumulated every day. A function that summarizes only the core of the article and shows it at a glance can save time for busy modern people and it will help them easily identify trends.

Dependencies 🌏

Mission Statement 📝

The ‘SqueezeNews’ project develops open-source software for extracting and curating articles. The ‘SqueezeNews’ allows for extracting articles using python web-crawling. It is designed to easily bring up the full text and keywords of news articles. Also, it provides various visual data(e.g. word cloud, graphs) according to the frequency of keywords.

Feature List 📋

  • Brings up the full text of news articles.

  • Summarizes articles and extracting keywords.

  • (planned) Creates a word cloud according to the frequency of keywords.

  • (planned) Search for similar articles.

  • (planned) Analysis of news article tendency according to positive or negative word ratio.

Target Development Language

Python .

How to use ✍️

1. Clone GitHub repository

https://github.com/rmakerck37/SqueezeNews.git

2. Install Dependencies

pip install -r requirements.txt

3. Run code

  • Get urls
import news_url_import

keyword = '금값'
num = 10
dictUrl = news_url_import.news_url_import(keyword, num)
  • Get article texts
import news_text

url = dictUrl[0]['url']
myText = news_text.text_extraction(url)
print(myText)
  • Get keywords
import news_keyword

myKeyword = news_keyword.get_keyword(url,'ko')
print(myKeyword)
  • Summarizing news
import news_summarization

mySummary = news_summarization.summarize(text=myText, max_sents=5)

License 🚩

Copyright © 2021 rmakerck37.

This project is available under the Apache-2.0 License.