Notice: This Wiki is now read only and edits are no longer possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.
SMILA/Documentation/HowTo/How to write a Pipelet
This page describes how to implement and configure your own pipelet in case you wish to add functionality to SMILA.
Contents
What are pipelets?
Pipelets are not standalone services, but their lifecycle and configuration is managed by the workflow engine. Each occurrence of a pipelet in a workflow uses a different pipelet instance. For more information take a look at Pipelets.
Project template
Before writing your own pipelet we recommend you to take a look at the HelloWorldPipelet.java. This pipelet is an example of a very simple processing pipelet and can be used as a template for your pipelets. You can also get it by importing the bundles from the examples directory of SMILA's repository into your workspace and use them as templates:
- org.eclipse.smila.integration.pipelet: template bundle for pipelet development
You can also download these examples from the release downloads or the nightly build downloads. (SMILA-...-integrator-examples.zip)
Implementation
Follow these instructions to implement a new pipelet in SMILA:
- If needed, create a new plugin project. You can add multiple pipelets to a single project. See Development Guidelines: Bundle creation for details.
- In the MANIFEST.MF, add at least these as "Imported Packages" (of course, you will need more to develop your pipelet, depending on what you want to do):
- org.eclipse.smila.blackboard
- org.eclipse.smila.datamodel
- org.eclipse.smila.processing
- org.eclipse.smila.utils
- org.eclipse.smila.processing.parameters (to use ParameterAccessor)
- org.eclipse.smila.processing.util (to use ResultCollector)
- Create a class that implements the interface org.eclipse.smila.processing.Pipelet and make sure that the class has a public no-argument constructor (or none at all).
- Implement void configure(AnyMap configuration). This method is called prior to process. Here you can read the configuration provided for the pipelet in the pipeline. To share those properties store the whole configuration in a member variable or better check the configuration for validity and completeness and then store the settings in a member variable. You can later access the configuration with the ParameterAccessor (see code sample below). The advantage of using the parameter accessor is that you can also override your configuration with configuration parameters set in the single records (e.g. the task parameters are included in the record by PipelineProcessorWorker, so you could override the pipelets configuration with job parameters as long as you use the ParameterAccessor to access parameters in the process method).
- Implement String[] process(Blackboard blackboard, String[] recordIds). Here you have to place the "business logic" of your pipelet. In most cases the result is the same as the input recordIDs. But it is possible to return another list of IDs as result. Use the ResultCollector.addResult(...) to collect the ids you want to return and ResultCollector.addFailedResult(...) to control the error handling behaviour of your pipelet. When creating the ResultCollector you have to determine if the Collector should drop the ids that caused errors from the result id list it creates (using resultCollector.getResultIds())or if these ids should still be part of the result id set. The ResultCollector will also control exception handling for you when setting failed record ids (depending on the existence and value of the parameter _failOnError, default is false). If _failOnError is set to true the ResultCollector will throw a ProcessingException, if not, the error will just be logged (on how to use the ResultCollector see the code sample below).
- Create a pipelet description json file for your pipelet in the SMILA-INF folder of the providing bundle. The file name is arbitrary but must have ".json" extension and the file content must contain at least the class name of the pipelet ({"class" : "your.class.name.here"}). Then they can be detected by the PipeletTracker service. If you would like to register multiple classes, use one separate description file for each class. Don't forget to add the SMILA-INF folder to the bin.includes entry of the bundle's build.properties file.
- Consider thread-safe-ness. Because pipelets may be accessed by multiple threads, make sure that access to member variables (e.g. the configuration) is read-only. For best practices: use local variables instead of member variables if possible.
Further reading
For more information on the ResultCollector see JavaDoc.
Configuration
If your pipelet requires a configuration:
- Add a <configuration> element to the <extensionActivity> section of your pipelet in the BPEL pipeline.
Integration
Running/Debugging in eclipse IDE
Before running SMILA in eclipse with your new pipelets, you have to activate your bundle containing the pipelets:
- Click on Run configurations...
- Select the OSGi Framework-->SMILA configuration
- In the Bundles page, check the box before org.eclipse.smila.integration.pipelet, leave Start Level and Auto-Start on default
- Click Apply
- After starting SMILA in eclipse, you should find your pipelets via REST API: http://localhost:8080/smila/pipelets
Deploy to SMILA installation
It's easy to export your pipelet bundle to a SMILA installation:
- Select your bundle (e.g. org.eclipse.smila.integration.pipelet)
- Right-click on it and click on Export...
- Select Plug-in Development --> Deployable plug-ins and fragments
- Next
- Select a destination folder
- Click Finish
After that you will find the plugin jar in your destination directory in a plugins folder. Copy the plugin jar to your SMILA installation's plugins folder. After (re-)starting SMILA, you should find your pipelets via REST API: http://localhost:8080/smila/pipelets
Examples
Pipelet Usage
This is a template for MyPipelet.java:
package org.eclipse.smila.mypackage import org.eclipse.smila.blackboard.Blackboard; import org.eclipse.smila.processing.ProcessingException; import org.eclipse.smila.processing.Pipelet; import org.eclipse.smila.datamodel.AnyMap; public class MyPipelet implements Pipelet { public void configure(AnyMap configuration) throws ProcessingException { // read the configuration properties } public String[] process(Blackboard blackboard, String[] recordIds) throws ProcessingException { // process the records and create a result } }
And this is how to register the pipelet class in the bundle: Create a folder SMILA-INF in the bundle and add a file MyPipelet.json to this folder:
{ "class": "org.eclipse.smila.mypackage.MyPipelet", "parameters": [ <put your parameter descriptions here> ], "description": "The textual description of my pipelet. This is optional but good style." }
Now add the folder SMILA-INF to the build.properties (or just check it in the Build view of the MANIFEST.MF file in your IDE.
And finally, this is a sample showing how a pipelet is invoked in the BPEL pipeline using an <extensionActivity>. It also shows how the pipelet is configured using a <configuration> section.
... <extensionActivity> <proc:invokePipelet name="invokeMyPipelet"> <proc:pipelet class="org.eclipse.smila.mypackage.MyPipelet" /> <proc:variables input="request" output="request" /> <proc:configuration> <rec:Val key="aStringParam">some value</rec:Val> <rec:Val key="aDateParam" type="datetime">2008-06-11T16:08:00.000+0200</rec:Val> </proc:configuration> </proc:invokePipelet> </extensionActivity> ...
Piplet configuration usage
The following example shows the usage of multiple values for properties:
... <extensionActivity> <proc:invokePipelet name="addValuesToNonExistingAttribute"> <proc:pipelet class="org.eclipse.smila.processing.pipelets.AddValuesPipelet" /> <proc:variables input="request" output="request"/> <proc:configuration> <rec:Val key="outputAttribute">out</rec:Val> <rec:Seq key="valuesToAdd"> <rec:Val>value1</rec:Val> <rec:Val>value2</rec:Val> </rec:Seq> </proc:configuration> </proc:invokePipelet> </extensionActivity> ...
public class AddValuesPipelet implements Pipelet { /** config property name for attribute name to add values to. */ private static final String PARAM_ATTRIBUTE = "outputAttribute"; /** config property name for the values to add. */ private static final String PARAM_VALUES = "valuesToAdd"; /** the pipelet's configuration. */ private AnyMap _config; /** local logger. */ private final Log _log = LogFactory.getLog(getClass()); /** add Any values to an attribute as described in pipelet config or parameters. */ @Override public String[] process(final Blackboard blackboard, final String[] recordIds) throws ProcessingException { final ParameterAccessor paramAccessor = new ParameterAccessor(blackboard, _config); final ResultCollector resultCollector = new ResultCollector(paramAccessor, _log, ProcessingConstants.DROP_ON_ERROR_DEFAULT); for (final String id : recordIds) { paramAccessor.setCurrentRecord(id); // the attribute to which to add the values. final String outputAttribute = paramAccessor.getParameter(PARAM_ATTRIBUTE, null); // the values to add. final Any values = paramAccessor.getParameterAny(PARAM_VALUES); if (values != null && outputAttribute != null) { try { for (final Any value : values) { blackboard.getMetadata(id).add(outputAttribute, value); } resultCollector.addResult(id); } catch (final Exception ex) { resultCollector.addFailedResult(id, ex); } } } return resultCollector.getResultIds(); } /** {@inheritDoc} */ @Override public void configure(final AnyMap configuration) throws ProcessingException { _config = configuration; } }