-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Pull requests: microsoft/markitdown
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: consecutive partial numbers wrongly merged + ipynb string source loses title
#2113
opened Jun 12, 2026 by
Sahilalgo8
Loading…
Add markitdown-dicom plugin for DICOM and DICONDE metadata extraction
#2112
opened Jun 12, 2026 by
timburman
Loading…
feat(grpc): gRPC server + streaming of PDF/PPTX files
#2109
opened Jun 11, 2026 by
krickert
Loading…
feat: extract embedded images from DOCX to local directory
#2107
opened Jun 11, 2026 by
Craftr-X
Loading…
fix: stream large xlsx files to prevent timeout (fixes #2096)
#2105
opened Jun 11, 2026 by
sbidwaibing
Loading…
fix: detect HTML charset from <meta> tag to fix garbled output on CJK-locale systems
#2104
opened Jun 11, 2026 by
liang-zhi-yi
Loading…
feat: add support for legacy .doc file format
#2100
opened Jun 10, 2026 by
li5435945-ship-it
Loading…
Recover PDF text truncated by inline images using PyMuPDF fallback
#2092
opened Jun 8, 2026 by
Muhtasim-Munif-Fahim
Loading…
fix: catch OSError when exiftool binary is missing (#1960)
#2082
opened Jun 6, 2026 by
doitgo
Loading…
Fix: Update ZipConverter docstring and optimize memory handling for large entries
#2079
opened Jun 5, 2026 by
KhudaBuxMagsi
Loading…
Don't abort PDF conversion on a single malformed page
#2078
opened Jun 5, 2026 by
assinscreedFC
Loading…
add Python 3.14 support and bump youtube-transcript-api for compatibility
#2077
opened Jun 5, 2026 by
shatovilya
Loading…
8 tasks done
fix(#15): Add model client example in docs/readme
#2070
opened Jun 4, 2026 by
ltianyi992
Loading…
4 of 5 tasks
feat: support TSV/other delimiters and fix Markdown table collisions
#2061
opened Jun 3, 2026 by
trippinganymess
Loading…
fix: allow DocumentIntelligenceConverter api_version to default to None (fixes #1904)
#2060
opened Jun 3, 2026 by
li5435945-ship-it
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-10.