WebAnnotator Manual
- What is WebAnnotator?
- Installing WebAnnotator
- Getting started
- Creating a new annotation schema
- Annotating pages
- Saving & Exporting
- Known bugs
- Citing WebAnnotator
What is WebAnnotator?
WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension, allowing annotation of both offline and inline pages. The HTML rendering is fully preserved and all annotations consist in new HTML spans with specific styles. WebAnnotator provides an easy and general-purpose framework and is made available under CeCILL free license (close to GNU GPL — see the license text), so that use and further contributions are made simple.
All parts of an HTML document can be annotated: text, images, videos, tables, menus, etc. The annotations are created by simply selecting a part of the document and clicking on the relevant type and subtypes. The annotated elements are then highlighted in a specific color. Annotation schemas can be defined by the user by creating a simple DTD representing the types and subtypes that must be highlighted. Finally, annotations can be saved (HTML with highlighted parts of documents) or exported (in a machine-readable format).
WebAnnotator will be presented at LREC conference, if you use it, please cite this reference.
Installing WebAnnotator
- The official last version of WebAnnotator is on Mozilla add-ons Web site (click on "Add to Firefox" to install)
- WebAnnotator menu now appears is in the Tools menu.
-
You can now add a toolbar button and get a quick access to
the extension menu. To get this button in your toolbar you must right-click
the toolbar and press customize, then select
WebAnnotator button
and drag it to the toolbar.
Getting started
You can play with the add-on by importing sample files
ne.dtd
or chunking.dtd
and
annotating text as shown in section
"Annotating pages". These two sample files
are also provided in the samples
directory of the extension
directory.
The first time you begin an annotation session, you might not see the bottom panel. It is because it has a height of zero. Just drag the horizontal line at the bottom and resize the panel.
Creating a new annotation schema
Annotation schemas can be specified to WebAnnotator by importing a DTD into the extension.
![](./files/import.png)
For example, the DTDs corresponding to the two sample annotation schemas provided with the extension are:
Chunking | Named Entities |
---|---|
<!-- Three high-level annotations types : NP, VP, PP -->
|
<!-- Four high-level annotations types : person, org, location, date -->
|
After choosing the DTD file, you can specify options, as the name and description of the schema, as well as colors of your annotations. All these options can be set later by clicking on the Options buttons in the WebAnnotation panel (bottom of the page when the extension is activated).
![](./files/options.png)
Annotating pages
For annotating a specific Web page, select Choose Schema in the WebAnnotator button menu. It is only possible to annotate one document at a time, that is why the button is not activated on other Firefox tabs when a session has begun.
WebAnnotator panel
At the beginning of the session, a panel will open at the bottom of the page. This panel will contain the annotations when you create them, as well as a bar for different settings.
![](./files/general.png)
Note that all links in the HTML document are deactivated by default, in order to allow easy selection of text. You can reactivate links by clicking on Activate Links in the right bottom of the page.
Creating new annotations
Once the annotation schema is annotated, creating new annotations is very simple. Select the elements (text, images, etc.) that you want to annotate, and choose their type in the frame that opens.
![](./files/select.png)
The new annotation will then appear in the panel table.
Editing and removing annotations
You can edit or delete a specific annotation by two different ways. The first method is to find your annotation in the panel and to click on one of the two icons at the left. Selecting the annotation in the panel table will make the annotation blink in the HTML document.
![](./files/edit_panel.png)
The second method is the put your mouse over the annotation itself, in the HTML document. The element will start blinking and the two same icons will appear.
![](./files/edit_text.png)
Saving & Exporting
Two output formats are available in WebAnnotator, namely save format and export format.
-
Save format leads to pretty much the same file as if you had
chosen Save as in the regular Firefox File menu,
except that a small clean-up is achieved in the WebAnnotator-specific
tags in the HTML file.
This format allows you to keep the visual information of annotated parts of documents, but only if the extension is activated. Opening the file in any navigator without the extension will result (visually speaking) in the original page without annotations.
With this format, you can also come back to annotating you document later, which is not the case with the export format. That is why choosing a Save file name is mandatory in the dialog box.
However, save format makes difficult the machine reading and automatic use of the document. Indeed, a same annotation can result in several HTMLspan
tags, in order to avoid overlaping of HTML elements. However, allspan
s coming from a single annotation have the sameWA-id
attribute, which make the automatic merging easier. Anyway, the export format is probably a better way to perform automatic treatments on WA-annotated files.
-
Export format replaces HTML
span
tags by two empty elements,WA_Start
andWA_End
, sharing the sameWA-id
and containing all types and subtypes information.
This makes automatic processing easier, but we can no longer visualize or re-annotate exported WebAnnotator files.
In case you want to keep local copies of all linked URIs in the document (images, CSS, scripts, etc.), check the Save linked URIs box in save dialog box.
![](./files/export.png)
Annotating with multiple schemas
You can annotate a Web page with a given schema, save the result, and then annotate with another schema. You can switch between schemas as many times as you want. Choosing to "keep color information" when saving your annotations will result in all annotations being visible at the same time. On the other hand, by uncheking this box you will see only the annotations of the currently activated schema.Known issues
Do not hesitate to contact me if you find any annoying bug in this add-on. Do not hesitate either if you wish to participate to the add-on evolution.- During an annotation session, closing the windows will lead to two exit messages. If you answer differently to both of them, this can lead to a state of the system where the extension is not usable any more. You then need to reboot Firefox to be able to use it again.
- Links inside iframes are not deactivated.
- Annotation by triple-click selection does not highlight the good text span. Problems on over-selection annotation by double-click have been reported but not reproduced.
Citing WebAnnotator
If you use WebAnnotator as an annotation tool for research, please cite:WebAnnotator, an Annotation Tool for Web Pages. Xavier Tannier. Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012, to appear). Istanbul, Turkey, 2012.