SMILA/Documentation/HowTo/How to write a Pipelet

This page describes how to implement and configure your own pipelet in case you wish to add functionality to SMILA.

What are pipelets?

Pipelets are not standalone services, but their lifecycle and configuration is managed by the workflow engine. Each occurrence of a pipelet in a workflow uses a different pipelet instance. For more information take a look at Pipelets.

Project template

Before writing your own pipelet we recommend you to take a look at the HelloWorldPipelet.java. This pipelet is an example of a very simple processing pipelet and can be used as a template for your pipelets. You can also get it by importing the bundles from the examples directory of SMILA's repository into your workspace and use them as templates:

org.eclipse.smila.integration.pipelet: template bundle for pipelet development

You can also download these examples from the release downloads or the nightly build downloads. (SMILA-...-integrator-examples.zip)

Implementation

Follow these instructions to implement a new pipelet in SMILA:

If needed, create a new plugin project. You can add multiple pipelets to a single project. See Development Guidelines: Bundle creation for details.

In the MANIFEST.MF, add at least these as "Imported Packages" (of course, you will need more to develop your pipelet, depending on what you want to do):
- org.eclipse.smila.blackboard
- org.eclipse.smila.datamodel
- org.eclipse.smila.processing
- org.eclipse.smila.utils
- org.eclipse.smila.processing.parameters (to use ParameterAccessor)
- org.eclipse.smila.processing.util (to use ResultCollector)

Create a class that implements the interface org.eclipse.smila.processing.Pipelet and make sure that the class has a public no-argument constructor (or none at all).

Implement void configure(AnyMap configuration). This method is called prior to process. Here you can read the configuration provided for the pipelet in the pipeline. To share those properties store the whole configuration in a member variable or better check the configuration for validity and completeness and then store the settings in a member variable. You can later access the configuration with the ParameterAccessor (see code sample below). The advantage of using the parameter accessor is that you can also override your configuration with configuration parameters set in the single records (e.g. the task parameters are included in the record by PipelineProcessorWorker, so you could override the pipelets configuration with job parameters as long as you use the ParameterAccessor to access parameters in the process method).

Do not access OSGi Services in configure() You should not try to use OSGi services in the configure() method, because during startup of SMILA configure() is called immediately during initialization of the BPEL engine, and it is easily possible that the required service has not been activated at that time, and if configure() throws an exception because it cannot lookup the required service, the pipeline will not be deployed and cannot be executed. It's better to lookup the services later in the process() method, because all services should be available then.

Implement String[] process(Blackboard blackboard, String[] recordIds). Here you have to place the "business logic" of your pipelet. In most cases the result is the same as the input recordIDs. But it is possible to return another list of IDs as result. Use the ResultCollector.addResult(...) to collect the ids you want to return and ResultCollector.addFailedResult(...) to control the error handling behaviour of your pipelet. When creating the ResultCollector you have to determine if the Collector should drop the ids that caused errors from the result id list it creates (using resultCollector.getResultIds())or if these ids should still be part of the result id set. The ResultCollector will also control exception handling for you when setting failed record ids (depending on the existence and value of the parameter _failOnError, default is false). If _failOnError is set to true the ResultCollector will throw a ProcessingException, if not, the error will just be logged (on how to use the ResultCollector see the code sample below).
Create a pipelet description json file for your pipelet in the SMILA-INF folder of the providing bundle. The file name is arbitrary but must have ".json" extension and the file content must contain at least the class name of the pipelet ({"class" : "your.class.name.here"}). Then they can be detected by the PipeletTracker service. If you would like to register multiple classes, use one separate description file for each class. Don't forget to add the SMILA-INF folder to the bin.includes entry of the bundle's build.properties file.

Consider thread-safe-ness. Because pipelets may be accessed by multiple threads, make sure that access to member variables (e.g. the configuration) is read-only. For best practices: use local variables instead of member variables if possible.

Configuration

If your pipelet requires a configuration:

Add a <configuration> element to the <extensionActivity> section of your pipelet in the BPEL pipeline.

Integration

Running/Debugging in eclipse IDE

Before running SMILA in eclipse with your new pipelets, you have to activate your bundle containing the pipelets:

Click on Run configurations...
Select the OSGi Framework-->SMILA configuration
In the Bundles page, check the box before org.eclipse.smila.integration.pipelet, leave Start Level and Auto-Start on default
Click Apply
After starting SMILA in eclipse, you should find your pipelets via REST API: http://localhost:8080/smila/pipelets

Deploy to SMILA installation

It's easy to export your pipelet bundle to a SMILA installation:

Select your bundle (e.g. org.eclipse.smila.integration.pipelet)
Right-click on it and click on Export...
Select Plug-in Development --> Deployable plug-ins and fragments
Next
Select a destination folder
Click Finish

After that you will find the plugin jar in your destination directory in a plugins folder. Copy the plugin jar to your SMILA installation's plugins folder. After (re-)starting SMILA, you should find your pipelets via REST API: http://localhost:8080/smila/pipelets

Examples

Pipelet Usage

This is a template for MyPipelet.java:

package org.eclipse.smila.mypackage

import org.eclipse.smila.blackboard.Blackboard;
import org.eclipse.smila.processing.ProcessingException;
import org.eclipse.smila.processing.Pipelet;
import org.eclipse.smila.datamodel.AnyMap;
 
public class MyPipelet implements Pipelet {
 
  public void configure(AnyMap configuration) throws ProcessingException {
    // read the configuration properties
  }
 
  public String[] process(Blackboard blackboard, String[] recordIds) throws ProcessingException {
    // process the records and create a result
  }
}

And this is how to register the pipelet class in the bundle: Create a folder SMILA-INF in the bundle and add a file MyPipelet.json to this folder:

{
   "class": "org.eclipse.smila.mypackage.MyPipelet",
   "parameters": [
     <put your parameter descriptions here>
   ],
   "description": "The textual description of my pipelet. This is optional but good style."
}

The description as well as the parameters section is optional but gives users a hint on which parameters you pipelet uses and what they mean as well as what your pipelet does. The parameters section follows the same restrictions as the workers' parameter descriptions, see SMILA/Documentation/ParameterDefinition for details.

Now add the folder SMILA-INF to the build.properties (or just check it in the Build view of the MANIFEST.MF file in your IDE.

And finally, this is a sample showing how a pipelet is invoked in the BPEL pipeline using an <extensionActivity>. It also shows how the pipelet is configured using a <configuration> section.

...
<extensionActivity>
    <proc:invokePipelet name="invokeMyPipelet">
        <proc:pipelet class="org.eclipse.smila.mypackage.MyPipelet" />
        <proc:variables input="request" output="request" />
        <proc:configuration>
            <rec:Val key="aStringParam">some value</rec:Val>
            <rec:Val key="aDateParam" type="datetime">2008-06-11T16:08:00.000+0200</rec:Val>
        </proc:configuration>       
    </proc:invokePipelet>
</extensionActivity>
...

You can also configure Pipelines with SMILA's BPEL designer. See BPEL Designer for more information and how to install it.

Piplet configuration usage

The following example shows the usage of multiple values for properties:

...
<extensionActivity>
  <proc:invokePipelet name="addValuesToNonExistingAttribute">
    <proc:pipelet class="org.eclipse.smila.processing.pipelets.AddValuesPipelet" />
    <proc:variables input="request" output="request"/>
    <proc:configuration>
      <rec:Val key="outputAttribute">out</rec:Val>
      <rec:Seq key="valuesToAdd">
        <rec:Val>value1</rec:Val>
        <rec:Val>value2</rec:Val>
      </rec:Seq>
    </proc:configuration>
  </proc:invokePipelet>
</extensionActivity>
...

public class AddValuesPipelet implements Pipelet {
 
  /** config property name for attribute name to add values to. */
  private static final String PARAM_ATTRIBUTE = "outputAttribute";
 
  /** config property name for the values to add. */
  private static final String PARAM_VALUES = "valuesToAdd";
 
  /** the pipelet's configuration. */
  private AnyMap _config;
 
  /** local logger. */
  private final Log _log = LogFactory.getLog(getClass());
 
  /** add Any values to an attribute as described in pipelet config or parameters. */
  @Override
  public String[] process(final Blackboard blackboard, final String[] recordIds) throws ProcessingException {
    final ParameterAccessor paramAccessor = new ParameterAccessor(blackboard, _config);
    final ResultCollector resultCollector =
      new ResultCollector(paramAccessor, _log, ProcessingConstants.DROP_ON_ERROR_DEFAULT);
    for (final String id : recordIds) {
      paramAccessor.setCurrentRecord(id);
      // the attribute to which to add the values.
      final String outputAttribute = paramAccessor.getParameter(PARAM_ATTRIBUTE, null);
      // the values to add.
      final Any values = paramAccessor.getParameterAny(PARAM_VALUES);
      if (values != null && outputAttribute != null) {
        try {
          for (final Any value : values) {
            blackboard.getMetadata(id).add(outputAttribute, value);
          }
          resultCollector.addResult(id);
        } catch (final Exception ex) {
          resultCollector.addFailedResult(id, ex);
        }
      }
    }
    return resultCollector.getResultIds();
  }
 
  /** {@inheritDoc} */
  @Override
  public void configure(final AnyMap configuration) throws ProcessingException {
    _config = configuration;
  }
}

Notice: This Wiki is now read only and edits are no longer possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

SMILA/Documentation/HowTo/How to write a Pipelet

Contents

What are pipelets?

Project template

Implementation

Further reading

Configuration

Integration

Examples

Pipelet Usage

Piplet configuration usage

Breadcrumbs

Notice: This Wiki is now read only and edits are no longer possible. Please see: https://gitlab.eclipse.org/eclipsefdn/helpdesk/-/wikis/Wiki-shutdown-plan for the plan.

SMILA/Documentation/HowTo/How to write a Pipelet

Contents

What are pipelets?

Project template

Implementation

Further reading

Configuration

Integration

Examples

Pipelet Usage

Piplet configuration usage