BOB: Business Objects Board
Not endorsed by or affiliated with SAP

Register | Login 

Want to sponsor BOB? 
Want to sponsor BOB? (Opens a new window)  

General Notice: Upcoming Events: SAP TechEd: Sep 28.

Reading matrix/table from PDF?


 
Search this topic... | Search DI: Text Analytics... | Search Box
Register or Login to Post    Forum Index -> Data Integrator -> DI: Text Analytics  Previous TopicPrint TopicNext Topic
Author Message
ErikR
Forum Enthusiast
Forum Enthusiast



Joined: 10 Jan 2007

Posts: 1085
Location: Wellington, NZ


flag
PostPosted: Wed Aug 24, 2016 4:00 pm 
Post subject: Reading matrix/table from PDF?

Hi all,


I have the following use case that I would like to ask your help with.


I have a product pricelist, in PDF form, that we need to read in using SAP Data Services 4.2 (SP07).
There are some introduction pages but then the product and prices are all in a matrix / tabular format for the following 20 pages or so.

I can read the PDF into Data Services as unstructured text. If I then use Text Data Processing, I can extract all the products (we have a custom dictionary for this as well).

However, the Entity Extraction process seems to ignore the price data completely?

Is there any way that I can get the Entity Extraction process to pick up the prices as well and associate this information with the product?
(With the product being referenced as the topic related to the prices?)

I've used SAP Data Services successfully to process other unstructured text, such as Tweets and emails, but it seems that it struggles more with "semi-structured" text than actually plain sentences?

Any help would be much appreciated!

_________________
Erik Roelofs
Soltius NZ - SAP Gold Partner
"New Zealand's most trusted SAP provider"

SAP Certified Application Associate - Data Integration with SAP Data Services 4.x
Back to top
ErikR
Forum Enthusiast
Forum Enthusiast



Joined: 10 Jan 2007

Posts: 1085
Location: Wellington, NZ


flag
PostPosted: Mon Dec 19, 2016 7:26 pm 
Post subject: Re: Reading matrix/table from PDF?

No one ever had to read in PDF files with table-structured data? Ever? icon_neutral.gif
_________________
Erik Roelofs
Soltius NZ - SAP Gold Partner
"New Zealand's most trusted SAP provider"

SAP Certified Application Associate - Data Integration with SAP Data Services 4.x
Back to top
Display posts from previous:   
Register or Login to Post    Forum Index -> Data Integrator -> DI: Text Analytics  Previous TopicPrint TopicNext Topic
Page 1 of 1 All times are GMT - 5 Hours
 
Jump to:  

Index | About | FAQ | RAG | Privacy | Search |  Register |  Login 

Get community updates via Twitter:

Not endorsed by or affiliated with SAP
Powered by phpBB © phpBB Group
Generated in 0.0099 seconds using 17 queries. (SQL 0.0030 Parse 0.0002 Other 0.0068)
CCBot/2.0 (http://commoncrawl.org/faq/)
Hosted by ForumTopics.com | Terms of Service
phpBB Customizations by the phpBBDoctor.com
Shameless plug for MomentsOfLight.com Moments of Light Logo