BusinessObjects Board

How to extract date part from a text

Hi.,

I have a text which is unstructured like ‘Booking from 01-Mar-2013 to 31-Mar-2013 for a total of 42 hrs’ and ‘Booking for 18-Mar-2013 for a total of 7.5 hrs’ and i want to extract only the Date part Like ‘Mar-2013’.

Can anyone let me know if it is possible using TA transform?? and How??

Thanks


paul.bods (BOB member since 2011-06-29)

Yes, you can use the Entity Extraction transform found under Text Data Processing on Data Services 4.+ for such a scenario.

Filter Entity Extraction Output – tutorial on how to filter Entity Extraction output using entity types defined by a selected language

http://www.sdn.sap.com/irj/scn/elearn?rid=/library/uuid/30d834a5-dc94-2e10-c0b4-e88a1ff66311

Text Data Processing Language Reference Guide – contains information about the linguistic analysis and extraction features of the Entity Extraction transform provides, as well as a reference section for each language module

http://help.sap.com/businessobject/product_guides/sboDS41/en/sbo411_ds_tdp_lang_ref_en.pdf

Look for the DATE, MONTH, and YEAR entity types.

Cheers


alwaite (BOB member since 2013-03-11)