We provide a general page parser that can intelligently retrieve article lists from sources. For each article URL, it first attempts to use `gne` for parsing, and if that fails, it will try using `llm`.
This solution allows scanning and extracting information from most general news and portal sources.
**However, we strongly recommend that users develop custom parsers for specific sources tailored to their actual business scenarios for more ideal and efficient scanning.**
We also provide a parser specifically for WeChat public articles (mp.weixin.qq.com).
**If you are willing to contribute your custom source-specific parsers to this repository, we would greatly appreciate it!**
## Custom Source Parser Development Specifications
### Specifications
**Remember It should be an asynchronous function**
1.**The parser should be able to intelligently distinguish between article list pages and article detail pages.**
2.**The parser's input parameters should only include `url` and `logger`:**
-`url` is the complete address of the source (type `str`).
-`logger` is the logging object (please do not configure a separate logger for your custom source parser).