Skip to content

Owners file#1

Merged
tisnik merged 1 commit into
lightspeed-core:mainfrom
tisnik:owners-file
May 1, 2025
Merged

Owners file#1
tisnik merged 1 commit into
lightspeed-core:mainfrom
tisnik:owners-file

Conversation

@tisnik
Copy link
Copy Markdown
Contributor

@tisnik tisnik commented May 1, 2025

No description provided.

@tisnik tisnik merged commit fd48c77 into lightspeed-core:main May 1, 2025
tisnik referenced this pull request in tisnik/lightspeed-stack Jul 9, 2025
@tisnik tisnik mentioned this pull request Jul 9, 2025
15 tasks
tisnik added a commit that referenced this pull request Jul 9, 2025
LCORE-316: bump-up dependencies #1
are-ces referenced this pull request in are-ces/lightspeed-stack Mar 29, 2026
Fix missing closing brace in LLAMA_STACK_IMAGE
tisnik pushed a commit that referenced this pull request Apr 30, 2026
Apply CodeRabbit's actionable comments and the per-comment nits:

1. PoC results section in spike doc previously listed paths under
   poc-results/ that are deleted before merge per howto-run-a-spike.md
   step 10, leaving broken links in the merged document. Replace the
   file list with a self-contained summary of what the PoC proved
   plus the heading-degradation finding, and a note pointing future
   readers at the PR diff if the raw artifacts are ever needed.

2. Drop the reference to docs/local-stack-testing.md (a local-only
   file, never committed to the repo).

3. Replace fragile line-numbered references (document_processor.py:75,
   :87, byok_guide.md ~106-118) with stable symbol anchors:
   _BaseDB.__init__, _LlamaStackDB.__init__, "Knowledge Sources"
   subsection, "Step 1" subsection. Line numbers rot; section names
   and symbol names rot less.

4. Spec doc now instructs the implementation ticket to extract the
   ("markdown", "html", "pdf") predicate to a single
   MARKDOWN_COMPATIBLE_DOC_TYPES: Final[tuple[str, ...]] constant in
   document_processor.py and reference it from both call sites,
   instead of duplicating the tuple. JIRA #1 scope updated to match.

5. Add R7: PDFReader.load_data emits a logger.warning when its docling
   output is empty / under a small threshold (a likely indicator of a
   scanned PDF given R5's no-OCR scope). Threshold is a module-level
   Final[int] constant. JIRA #1 scope and JIRA #2 test patterns
   updated to require coverage via caplog. Surfacing the silent-
   degradation case in custom_processor.py logs costs nothing and
   makes the OCR-needed signal visible.

Plus the two reviewer nits worth carrying into JIRA #1:

- Use docling's TableFormerMode.ACCURATE enum, not the string literal
  "accurate"; both work via Pydantic coercion but the enum is
  type-checked.
- Mirror HTMLReader's choice on whether to call super().__init__();
  llama-index's BaseReader does not require it but symmetry between
  the two readers is preferred.

The spec doc changelog records this revision and its trigger (the
PR #1598 CodeRabbit review).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant