XSLT transformation via FREME
FREME let you use a wide area of XML and HTML content for its e-Services. This is possible by previously transforming the input with XSLT stylesheets into acceptable FREME input. The workflow can be forged into a FREME pipeline to achieve reusability.
Contents
Available XSLT converter
Using XSLT converter
Any available converter can be used by sending XML or HTML content to the endpoint https://api.freme-project.eu/current/toolbox/xslt-converter/documents/CONVERTER-NAME
:
curl -X POST --header 'Content-Type: text/html' -d '<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>The HTML5 Herald</title>
<meta name="description" content="The HTML5 Herald">
<meta name="author" content="SitePoint">
</head>
<body>
<p>hello world</p>
</body>
</html>' 'https://api.freme-project.eu/current/toolbox/xslt-converter/documents/identity-transformation'
The request above simply transforms any valid html document into valid xml. The result should look like:
<?xml version="1.0" encoding="UTF-8"?>
<html xmlns="http://www.w3.org/1999/xhtml" lang="en">
<head>
<meta charset="utf-8"/>
<title>The HTML5 Herald</title>
<meta name="description" content="The HTML5 Herald"/>
<meta name="author" content="SitePoint"/>
</head>
<body>
<p>hello world</p>
</body>
</html>
NOTE: The endpoint defaults to text/xml
for both, the type of the input and output. If you submit HTML content, you have to set the header Content-Type: text/html
. To get HTML back, set the header Accept: text/html
.
NOTE: Using CURL makes it necessary to set the Content-Type header also for text/xml
.
You can play around with existing converters at the interactive FREME api documentation.
Submitting stylesheet parameters
Using a stylesheet that defines global parameters makes it possible to set them while calling the converter. To do so, just submit the parameter-value-pairs with the request.
Have a look at the stylesheet of the converter xslt-with-param:
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="1.0">
<xsl:param name="myparam">set internally</xsl:param>
<xsl:template match="/">
<xsl:value-of select="$myparam"/>
</xsl:template>
</xsl:stylesheet>
The stylesheet simply outputs the content of the parameter myparam
and discards the input. So when you call:
curl -X POST --header 'Content-Type: text/xml' -d '<?xml version="1.0"?>
<note>
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Dont forget me this weekend!</body>
</note>' 'https://api.freme-project.eu/current/toolbox/xslt-converter/documents/xslt-with-param'
you get this:
<?xml version="1.0" encoding="UTF-8"?>set internally
But adding the parameter myparam
to the request:
curl -X POST --header 'Content-Type: text/xml' -d '<?xml version="1.0"?>
<note>
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Dont forget me this weekend!</body>
</note>' 'https://api.freme-project.eu/current/toolbox/xslt-converter/documents/xslt-with-param?myparam=content'
gives this result:
<?xml version="1.0" encoding="UTF-8"?>content
XSLT converter in a pipeline
You can save your workflow in a FREME pipeline. The following pipeline converts xliff to html, does named entity recognition with html roundtripping and converts back to xliff:
[
{
"method": "POST",
"endpoint": "https://api.freme-project.eu/current/toolbox/xslt-converter/documents/xliff20-to-html",
"headers": {
"content-type": "text/xml",
"accept": "text/xml"
}
},
{
"method": "POST",
"endpoint": "https://api.freme-project.eu/current/e-entity/freme-ner/documents?language=en&dataset=dbpedia&mode=all",
"headers": {
"content-type": "text/html",
"accept": "text/html"
}
},
{
"method": "POST",
"endpoint": "https://api.freme-project.eu/current/toolbox/xslt-converter/documents/html-to-xliff20",
"headers": {
"content-type": "text/xml",
"accept": "text/xml"
}
}
]
To use the pipeline, put your input (which has to be json encoded) in the body element of the first request and send it to the FREME pipeline endpoint https://api.freme-project.eu/currentpipelining/chain
.
For instance, execute the following:
curl -X POST -H "Content-Type: application/json" -d '[
{
"method": "POST",
"endpoint": "https://api.freme-project.eu/current/toolbox/xslt-converter/documents/xliff20-to-html",
"headers": {
"content-type": "text/xml",
"accept": "text/xml"
},
"body": "<xliff version=\"2.0\" xmlns=\"urn:oasis:names:tc:xliff:document:2.0\" srcLang=\"en\" trgLang=\"fr\">\n <file id=\"f1\">\n <unit id=\"u1\">\n <segment>\n <source>We very much welcome you in the city of Prague, a home of XML!<\/source>\n <\/segment>\n <\/unit>\n <\/file>\n<\/xliff>"
},
{
"method": "POST",
"endpoint": "https://api.freme-project.eu/current/e-entity/freme-ner/documents?language=en&dataset=dbpedia&mode=all",
"headers": {
"content-type": "text/html",
"accept": "text/html"
}
},
{
"method": "POST",
"endpoint": "https://api.freme-project.eu/current/toolbox/xslt-converter/documents/html-to-xliff20",
"headers": {
"content-type": "text/xml",
"accept": "text/xml"
}
}
]' "https://api.freme-project.eu/current/pipelining/chain?useI18n=false"
Note: The parameter useI18n=false
is necessary here to suppress e-Internalization roundtripping which otherwise would be activated in this case by default. If roundtripping conditions hold, e.g. the input format is the same as the output format and both are accepted e-Internalization output formats, the input gets converted to NIF (turtle) before it enters the pipeline and the pipeline result gets converted back to the output/input format. Furthermore, all pipeline internal input and output formats are set to NIF (turtle). This would collide with the XSLT converters, which require and produce xml or html input/output.
The request above returns enriched xliff:
<?xml version="1.0" encoding="UTF-8"?>
<xliff xmlns="urn:oasis:names:tc:xliff:document:2.0"
version="2.0"
srcLang="en"
trgLang="fr">
<file id="f1">
<unit id="u1">
<segment>
<source xmlns:itsm="urn:oasis:names:tc:xliff:itsm:2.1">We very much welcome you in the city of <mrk id="d5e18"
type="itsm:generic"
itsm:taIdentRef="http://dbpedia.org/resource/Prague">Prague</mrk>, a home of <mrk id="d5e21"
type="itsm:generic"
itsm:taIdentRef="http://dbpedia.org/resource/XML">XML</mrk>!</source>
</segment>
</unit>
</file>
</xliff>
You can easily save and reuse your pipeline via the interactive api documentation.
Manage XSLT converter
XSLT converters can be managed via the REST API endpoint /toolbox/xslt-converter/manage/{converterName}
. All XSLT converter management requests can be executed easily via the interactive API documentation. XSLT converters are restricted resources, so some requests need authenticated access. See authentication for further information.
NOTE: When using the following examples, don’t forget to replace YOUR_TOKEN
by your authentication token.
Add a XSLT converter
curl -X POST --header "X-Auth-Token: YOUR_TOKEN" --header "Content-Type: text/plain" -d XSLT_STYLESHEET "https://api.freme-project.eu/current/toolbox/xslt-converter/manage?name={converterName}"
The body of this request has to be a valid XSLT stylesheet.
Example:
curl -X POST --header "X-Auth-Token: YOUR_TOKEN" --header "Content-Type: text/plain" -d "<?xml version=\"1.0\" encoding=\"UTF-8\"?>\n<xsl:stylesheet xmlns:xsl=\"http://www.w3.org/1999/XSL/Transform\"\n xmlns:xs=\"http://www.w3.org/2001/XMLSchema\"\n exclude-result-prefixes=\"xs\"\n version=\"2.0\">\n <xsl:template match=\"node()|@*\">\n <xsl:copy>\n <xsl:apply-templates select=\"node()|@*\"></xsl:apply-templates>\n </xsl:copy>\n </xsl:template>\n</xsl:stylesheet>" "https://api.freme-project.eu/current/toolbox/xslt-converter/manage?name=identity-transform"
Get one XSLT converter
curl -X GET [--header "X-Auth-Token: YOUR_TOKEN"] "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/{converterName}"
Example:
curl -X GET "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/identity-transform"
Get all XSLT converters
This request returns all converters to which the currently authenticated user has read access, see authentication for further information.
curl -X GET [--header "X-Auth-Token: YOUR_TOKEN"] "https://api.freme-project.eu/current/toolbox/xslt-converter/manage"
Example:
curl -X GET "https://api.freme-project.eu/current/toolbox/xslt-converter/manage"
Update a XSLT converter
curl -X PUT --header "X-Auth-Token: YOUR_TOKEN" --header "Content-Type: text/plain" -d NEW_XSLT_STYLESHEET "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/{converterName}[&newOwner=NEW_OWNER_NAME][&visibility=NEW_VISIBILITY]"
Examples:
This changes the converter:
curl -X PUT --header "X-Auth-Token: YOUR_TOKEN" --header "Content-Type: text/plain" -d "<xsl:stylesheet xmlns:xsl=\"http://www.w3.org/1999/XSL/Transform\" version=\"1.0\">\n <xsl:param name=\"myparam\">set internally</xsl:param>\n <xsl:template match=\"/\">\n <xsl:value-of select=\"$myparam\"/>\n </xsl:template>\n</xsl:stylesheet>\n" "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/identity-transform"
To change the owner, the visibility and the description (this can also be done separately), you can do this:
curl -X PUT --header "X-Auth-Token: YOUR_TOKEN" --header "Content-Type: text/plain" "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/identity-transform?newOwner=klaus&visibility=private&description=new%20description"
NOTE: The User klaus
has to exist.
NOTE: The two example requests can be merged, it was splitted just for explanation purposes.
Delete a XSLT converter
curl -X DELETE --header "X-Auth-Token: YOUR_TOKEN" "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/{converterName}"
Example:
curl -X DELETE --header "X-Auth-Token: YOUR_TOKEN" "https://api.freme-project.eu/current/toolbox/xslt-converter/manage/identity-transform"