refactor(tests): YAML-driven fixtures for preprocessor tests

- cases.yaml: 10 test cases con schema dichiarativo (op, assertions)
- data/: 7 file reali (email_action.html, email_thread.html, email_single.html,
  email_heavy.html, generic_page.html, notes.txt, fallback.txt)
- test_preprocessors.py: parametrize da YAML via test_detect / test_preprocess;
  assertion engine generico (no_html_tags, min_length, compression_ratio,
  metadata_keys, contains, not_contains, content_type)
- requirements.txt: add PyYAML

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Roberto Musso
2026-04-07 10:44:41 +02:00
parent a2d6d689e4
commit bf445ac2ce
10 changed files with 424 additions and 180 deletions

View File

@@ -0,0 +1,3 @@
random text content without any structure
line two with some words
line three and more content here