Spire.PDF is a professional PDF library applied to creating, writing, editing, handling and reading PDF files without any external dependencies. Get free and professional technical support for Spire.PDF for .NET, Java, Android, C++, Python.

Fri Aug 23, 2024 4:38 pm

Hi all,

Can you suggest a strategy for pulling vertical text, like:

t
x
e
t

s
i

s
i
h
t

But rotated 90 degrees, from a table cell without dropping characters or ignoring spaces? If I try the getText() method in PdfTable I get things like thisistext or thiss tet (missing some letters/characters).

Also, there are checkmarks under these vertical cells, which are recognized as text if you do OCR in Adobe Acrobat but aren't recognized as anything, as far as I can tell (the table cells come back empty) with Spire.PDF (or anything else, for that matter). Can you suggest a way to identify them? They're green, if that helps.

Thanks!

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Mon Aug 26, 2024 8:25 am

Hello,

Thank you for your inquiry. I conducted a simple simulation test with the latest (Spire.PDF for Java Version:10.8.1) and found that using the 'getText() method' can also recognize vertical text correctly. In order to assist us in conducting a more accurate investigation and resolving your issue, could you provide us with relevant input PDF documents?You can upload here or send it to us via email( support@e-iceblue.com ). Thank you in advance.

Sincerely,
Amin
E-iceblue support team
User avatar

Amin.Gan
 
Posts: 283
Joined: Mon Jul 15, 2024 5:40 am

Thu Sep 05, 2024 5:01 pm

Apologies for the long delay; it took some time to get the dummy report I needed. Please find it attached as a PDF inside a ZIP.

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Fri Sep 06, 2024 7:43 am

Hello,

Thank you for your feedback.
Using your new attached PDF document, I was able to reproduce the same issue. This issue has been logged in our bug tracking system under the number SPIREPDF-7026. Our Dev team will investigate it further, once there is any update, we will let you know.

Sincerely,
Amin
E-iceblue support team
User avatar

Amin.Gan
 
Posts: 283
Joined: Mon Jul 15, 2024 5:40 am

Fri Sep 06, 2024 2:29 pm

Thank you;
This is a critical issue for us, so any progress you can make would be greatly appreciated.
Best,
Jessica

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Fri Sep 06, 2024 2:45 pm

Actually, I just noticed something else that isn't replicated in this dummy report but may be connected -- in the "Amount Provided" table cell under "Chain of Custody", any value that has spaces in it gets only the text before the first space using getText(). I unfortunately cannot create another demo report, but given that this is a similar bug to the missing characters in the vertical text, I'd be willing to bet it's related.

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Mon Sep 09, 2024 8:13 am

Hello,

Thank you for your feedback.
I have added an extra space to your previous document and tested it using the latest version(Spire.PDF for Java : 10.9.0) , but the result still shows that everything is normal. I have attached the modified PDF file and a screenshot of my test results for your reference. Also, if you are not using our latest version, you can upgrade first. If only your other documents have this issue, we also need your document to further reproduce this problem.Thank you in advance.
In addition, regarding SPIREPDF-7026, our Dev team is investigating it now. Once there is any significant progress, we will inform you as soon as possible.

Sincerely,
Amin
E-iceblue support team
User avatar

Amin.Gan
 
Posts: 283
Joined: Mon Jul 15, 2024 5:40 am

Fri Sep 27, 2024 3:50 pm

Just checking in, since it's been about 3 weeks since we last spoke on this issue. Any updates on this?

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Sun Sep 29, 2024 2:06 am

Hello,

Thank you for following up.
Our Dev team did some investigation but due to the complex computational layout, sorry that there is no significant progress so far. I have reminded them again to look for a solution as soon as possible.They would keep looking into it. Once there is any progress, we will provide you with feedback in a timely manner.

Sincerely,
Amin
E-iceblue support team
User avatar

Amin.Gan
 
Posts: 283
Joined: Mon Jul 15, 2024 5:40 am

Mon Sep 30, 2024 4:54 pm

Thank you;
If it helps our needs have changed slightly; we only need to know if there are checkmarks in the cells in the visible row directly under the cells with vertical text. The need to get the vertical text as present in the PDF without losing characters remains, however.

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Wed Oct 02, 2024 9:51 am

Hello,

Thanks for your reply.
Due to October 1st to 7th being China's National Day, we are currently on vacation. Regarding your issue, we will further investigate and find a solution when we return to the office after the holiday. We will notify you as soon as there is any progress.

Sincerely,
William
E-iceblue support team
User avatar

William.Zhang
 
Posts: 454
Joined: Mon Dec 27, 2021 2:23 am

Tue Oct 29, 2024 3:36 am

Hi all,
As it's been almost a month since the last reply on this, and there's a new version of Spire.PDF out (10.10), is this fixed in that version?
Thanks!

jsrcastrogiovanni
 
Posts: 16
Joined: Thu Aug 22, 2024 6:02 pm

Tue Oct 29, 2024 10:10 am

Hello,

Thanks for your following up. Sorry, this issue has not been resolved yet as it involves complex character position calculations. Our dev team is actively seeking solutions, and I will also urge them again. If there are any updates, I will notify you immediately. Thank you for your patience.

Sincerely,
Amin
E-iceblue support team
User avatar

Amin.Gan
 
Posts: 283
Joined: Mon Jul 15, 2024 5:40 am

Return to Spire.PDF