<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//TaxonX//DTD Taxonomic Treatment Publishing DTD v0 20100105//EN" "../../nlm/tax-treatment-NS0.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:tp="http://www.plazi.org/taxpub" article-type="research-article" dtd-version="3.0" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">109</journal-id>
      <journal-id journal-id-type="index">urn:lsid:arphahub.com:pub:3dc5f44e-8666-58db-bc76-a455210e8891</journal-id>
      <journal-title-group>
        <journal-title xml:lang="en">JUCS - Journal of Universal Computer Science</journal-title>
        <abbrev-journal-title xml:lang="en">jucs</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="ppub">0948-695X</issn>
      <issn pub-type="epub">0948-6968</issn>
      <publisher>
        <publisher-name>Journal of Universal Computer Science</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3217/jucs-020-02-0213</article-id>
      <article-id pub-id-type="publisher-id">22953</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
        <subj-group subj-group-type="scientific_subject">
          <subject>H.3.0 - General</subject>
          <subject>H.4 - INFORMATION SYSTEMS APPLICATIONS</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Combining Psycho-linguistic, Content-based and Chat-based Features to Detect Predation in Chatrooms</article-title>
      </title-group>
      <contrib-group content-type="authors">
        <contrib contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Parapar</surname>
            <given-names>Javier</given-names>
          </name>
          <email xlink:type="simple">javierparapar@udc.es</email>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name name-style="western">
            <surname>Losada</surname>
            <given-names>David E.</given-names>
          </name>
          <xref ref-type="aff" rid="A2">2</xref>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name name-style="western">
            <surname>Barreiro</surname>
            <given-names>Álvaro</given-names>
          </name>
          <xref ref-type="aff" rid="A3">3</xref>
        </contrib>
      </contrib-group>
      <aff id="A1">
        <label>1</label>
        <addr-line content-type="verbatim">University of A Coruña, Coruña, Spain</addr-line>
        <institution>University of A Coruña</institution>
        <addr-line content-type="city">Coruña</addr-line>
        <country>Spain</country>
      </aff>
      <aff id="A2">
        <label>2</label>
        <addr-line content-type="verbatim">Universidade de Santiago de Compostela, Santiago de Compostela, Spain</addr-line>
        <institution>Universidade de Santiago de Compostela</institution>
        <addr-line content-type="city">Santiago de Compostela</addr-line>
        <country>Spain</country>
      </aff>
      <aff id="A3">
        <label>3</label>
        <addr-line content-type="verbatim">University of A Coruña, A Coruña, Spain</addr-line>
        <institution>University of A Coruña</institution>
        <addr-line content-type="city">A Coruña</addr-line>
        <country>Spain</country>
      </aff>
      <author-notes>
        <fn fn-type="corresp">
          <p>Corresponding author: Javier Parapar (<email xlink:type="simple">javierparapar@udc.es</email>).</p>
        </fn>
        <fn fn-type="edited-by">
          <p>Academic editor: </p>
        </fn>
      </author-notes>
      <pub-date pub-type="collection">
        <year>2014</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>01</day>
        <month>02</month>
        <year>2014</year>
      </pub-date>
      <volume>20</volume>
      <issue>2</issue>
      <fpage>213</fpage>
      <lpage>239</lpage>
      <uri content-type="arpha" xlink:href="http://openbiodiv.net/79F303BE-BE4C-5A20-9C19-1401C8A821EA">79F303BE-BE4C-5A20-9C19-1401C8A821EA</uri>
      <uri content-type="zenodo_dep_id" xlink:href="https://zenodo.org/record/5504797">5504797</uri>
      <history>
        <date date-type="received">
          <day>31</day>
          <month>01</month>
          <year>2014</year>
        </date>
        <date date-type="accepted">
          <day>28</day>
          <month>01</month>
          <year>2013</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>Javier Parapar, David E. Losada, Álvaro Barreiro</copyright-statement>
        <license license-type="creative-commons-attribution" xlink:href="" xlink:type="simple">
          <license-p>This article is freely available under the J.UCS Open Content License.</license-p>
        </license>
      </permissions>
      <abstract>
        <label>Abstract</label>
        <p>The Digital Age has brought great benefits for the human race but also some draw-backs. Nowadays, people from opposite corners of the World can communicate online via instant messaging services. Unfortunately, this has introduced new kinds of crime. Sexual predators haveadapted their predatory strategies to these platforms and, usually, the target victims are kids. The authorities cannot manually track all threats because massive amounts of online conversationstake place in a daily basis. Automatic methods for alerting about these crimes need to be designed. This is the main motivation of this paper, where we present a Machine Learning approachto identify suspicious subjects in chat-rooms. We propose novel types of features for representing the chatters and we evaluate different classifiers against the largest benchmark available.This empirical validation shows that our approach is promising for the identification of predatory behaviour. Furthermore, we carefully analyse the characteristics of the learnt classifiers. Thispreliminary analysis is a first step towards profiling the behaviour of the sexual predators when chatting on the Internet.</p>
      </abstract>
    </article-meta>
  </front>
</article>
