<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//TaxonX//DTD Taxonomic Treatment Publishing DTD v0 20100105//EN" "../../nlm/tax-treatment-NS0.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:tp="http://www.plazi.org/taxpub" article-type="research-article" dtd-version="3.0" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">109</journal-id>
      <journal-id journal-id-type="index">urn:lsid:arphahub.com:pub:3dc5f44e-8666-58db-bc76-a455210e8891</journal-id>
      <journal-title-group>
        <journal-title xml:lang="en">JUCS - Journal of Universal Computer Science</journal-title>
        <abbrev-journal-title xml:lang="en">jucs</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="ppub">0948-695X</issn>
      <issn pub-type="epub">0948-6968</issn>
      <publisher>
        <publisher-name>Journal of Universal Computer Science</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3217/jucs-019-16-2472</article-id>
      <article-id pub-id-type="publisher-id">23931</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
        <subj-group subj-group-type="scientific_subject">
          <subject>H.3.1 - Content Analysis and Indexing</subject>
          <subject>H.3.2 - Information Storage</subject>
          <subject>H.3.3 - Information Search and Retrieval</subject>
          <subject>H.3.7 - Digital Libraries</subject>
          <subject>H.5.1 - Multimedia Information Systems</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Text Analysis for Monitoring Personal Information Leakage on Twitter</article-title>
      </title-group>
      <contrib-group content-type="authors">
        <contrib contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Choi</surname>
            <given-names>Dongjin</given-names>
          </name>
          <email xlink:type="simple">dongjin.choi84@gmail.com</email>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name name-style="western">
            <surname>Kim</surname>
            <given-names>Jeongin</given-names>
          </name>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name name-style="western">
            <surname>Piao</surname>
            <given-names>Xeufeng</given-names>
          </name>
          <xref ref-type="aff" rid="A2">2</xref>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name name-style="western">
            <surname>Kim</surname>
            <given-names>Pankoo</given-names>
          </name>
        </contrib>
      </contrib-group>
      <aff id="A1">
        <label>1</label>
        <addr-line content-type="verbatim">Chosun University, Gwangju, Republic of Korea</addr-line>
        <institution>Chosun University</institution>
        <addr-line content-type="city">Gwangju</addr-line>
        <country>Republic of Korea</country>
      </aff>
      <aff id="A2">
        <label>2</label>
        <addr-line content-type="verbatim">School of Computer Science and Technology, Weihai, China</addr-line>
        <institution>School of Computer Science and Technology</institution>
        <addr-line content-type="city">Weihai</addr-line>
        <country>China</country>
      </aff>
      <aff id="A3">
        <label>3</label>
        <addr-line content-type="verbatim">Chosun University, Gwangju, </addr-line>
        <institution>Chosun University</institution>
        <addr-line content-type="city">Gwangju</addr-line>
      </aff>
      <author-notes>
        <fn fn-type="corresp">
          <p>Corresponding author: Dongjin Choi (<email xlink:type="simple">dongjin.choi84@gmail.com</email>).</p>
        </fn>
        <fn fn-type="edited-by">
          <p>Academic editor: </p>
        </fn>
      </author-notes>
      <pub-date pub-type="collection">
        <year>2013</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>01</day>
        <month>10</month>
        <year>2013</year>
      </pub-date>
      <volume>19</volume>
      <issue>16</issue>
      <fpage>2472</fpage>
      <lpage>2485</lpage>
      <uri content-type="arpha" xlink:href="http://openbiodiv.net/AFCB741B-733C-588B-8791-C0C93EFC9705">AFCB741B-733C-588B-8791-C0C93EFC9705</uri>
      <uri content-type="zenodo_dep_id" xlink:href="https://zenodo.org/record/5506113">5506113</uri>
      <history>
        <date date-type="received">
          <day>15</day>
          <month>07</month>
          <year>2013</year>
        </date>
        <date date-type="accepted">
          <day>27</day>
          <month>09</month>
          <year>2013</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>Dongjin Choi, Jeongin Kim, Xeufeng Piao, Pankoo Kim</copyright-statement>
        <license license-type="creative-commons-attribution" xlink:href="" xlink:type="simple">
          <license-p>This article is freely available under the J.UCS Open Content License.</license-p>
        </license>
      </permissions>
      <abstract>
        <label>Abstract</label>
        <p>Social networking services (SNSs) such as Twitter and Facebook can be considered as new forms of media. Information spreads much faster through social media than any other forms of traditional news media because people can upload information with no time and location constraints. For this reason, people have embraced SNSs and allowed them to become an integral part of their everyday lives. People express their emotional status to let others know how they feel about certain information or events. However, they are likely not only to share information with others but also to unintentionally expose personal information such as their place of residence, phone number, and date of birth. If such information is provided to users with inappropriate intentions, there may be serious consequences such as online and offline stalking. To prevent information leakages and detect spam, many researchers have monitored e-mail systems and web blogs. This paper considers text messages on Twitter, which is one of the most popular SNSs in the world, to reveal various hidden patterns by using several coefficient approaches. This paper focuses on users who exchange Tweets and examines the types of information that they reciprocate other's Tweets by monitoring samples of 50 million Tweets which were collected by Stanford University in November 2009. We chose an active Twitter user based on "happy birthday" rule and detecting their information related to place to live and personal names by using proposed coefficient method and compared with other coefficient approaches. As a result of this research, we can conclude that the proposed coefficient method is able to detect and recommend the standard English words for non-standard words in few conditions. Eventually, we detected 88,882 (24.287%) more name included Tweets and 14,054 (3.84%) location related Tweets compared by using only standard word matching method.</p>
      </abstract>
    </article-meta>
  </front>
</article>
