mirror of https://github.com/TeamWiseFlow/wiseflow.git synced 2025-01-23 02:20:20 +08:00

History

bigbrother666sh 3e07d63757 llm wrapper and prompt opz, base url bug, mp scraper opz		2025-01-18 13:47:47 +08:00
..
reports	v0.3.7 test report	2025-01-17 21:47:03 +08:00
craw4ai_fetching.py	adapt to new version crawl4ai	2025-01-09 23:30:19 +08:00
crawlee_fetching.py	v0.3.6 mockup	2025-01-04 23:36:18 +08:00
get_info_test.py	0.3.7 release	2025-01-17 23:28:22 +08:00
get_visual_info_for_samples.py	v0.3.6 release	2025-01-05 18:12:36 +08:00
pre_process_test.py	llm wrapper and prompt opz, base url bug, mp scraper opz	2025-01-18 13:47:47 +08:00
read_markdown.py	add weixin scrapers	2025-01-14 20:39:28 +08:00
README_EN.md	llm wrapper and prompt opz, base url bug, mp scraper opz	2025-01-18 13:47:47 +08:00
README.md	llm wrapper and prompt opz, base url bug, mp scraper opz	2025-01-18 13:47:47 +08:00

README_EN.md

Test Script Documentation

Web Content Fetching and Parsing

craw4ai_fetching.py

python craw4ai_fetching.py -S 'url1,url2...'

HTML Content Parsing

pre_process_test.py

python pre_process_test.py -F 'json_file_path' -R 'record save path'

Large Model Information Extraction Testing

get_info_test.py

- To create focus point descriptions for test tasks, refer to [reports/wiseflow_report_v036_bigbrother666/task0/focus_point.json](./reports/wiseflow_report_v036_bigbrother666/task0/focus_point.json)

python get_info_test.py -D 'sample dir' -I 'include ap'

-I whether to test LLM extraction of author and publish date

Wiseflow is an open source project aiming to create an "information crawling tool for everyone" through collective contributions!

At this stage, submitting test results is equivalent to submitting project code - you'll be accepted as a contributor and may even be invited to participate in commercial projects!

Test results should be submitted to the reports directory. Create a subdirectory for each test named {test_content}_{test_date}_{tester}, for example:

mkdir -p reports/wiseflow_report_v036_bigbrother666

Please submit all test samples and the original output results of the program run, and create a README.md file in the directory to record the test content, test date, tester, test models, conclusions, statistical tables, etc.

Finally, edit the reports/README.md file, add the directory name of the test result to the index, so that others can view it.

README_EN.md

Test Script Documentation

Web Content Fetching and Parsing

HTML Content Parsing

Large Model Information Extraction Testing

Result Submission and Sharing