<?xml version='1.0' encoding='utf-8'?>
<eprints xmlns='http://eprints.org/ep2/data/2.0'>
  <eprint id='https://researchdata.gla.ac.uk/id/eprint/588'>
    <eprintid>588</eprintid>
    <rev_number>19</rev_number>
    <documents>
      <document id='https://researchdata.gla.ac.uk/id/document/1855'>
        <docid>1855</docid>
        <rev_number>2</rev_number>
        <files>
          <file id='https://researchdata.gla.ac.uk/id/file/10078'>
            <fileid>10078</fileid>
            <datasetid>document</datasetid>
            <objectid>1855</objectid>
            <filename>Dataset_588.tar.gz</filename>
            <mime_type>application/x-gzip</mime_type>
            <hash>2e33b50f30e5833a533d05f1675b1519</hash>
            <hash_type>MD5</hash_type>
            <filesize>1733274348</filesize>
            <mtime>2018-03-14 10:48:13</mtime>
            <url>https://researchdata.gla.ac.uk/588/1/Dataset_588.tar.gz</url>
          </file>
        </files>
        <eprintid>588</eprintid>
        <pos>1</pos>
        <placement>1</placement>
        <mime_type>application/x-gzip</mime_type>
        <format>Mixed</format>
        <language>en</language>
        <security>public</security>
        <license>cc_by_4</license>
        <main>Dataset_588.tar.gz</main>
        <content>full_archive</content>
      </document>
      <document id='https://researchdata.gla.ac.uk/id/document/1998'>
        <docid>1998</docid>
        <rev_number>2</rev_number>
        <files>
          <file id='https://researchdata.gla.ac.uk/id/file/10761'>
            <fileid>10761</fileid>
            <datasetid>document</datasetid>
            <objectid>1998</objectid>
            <filename>README</filename>
            <mime_type>text/plain</mime_type>
            <hash>0d96aa7d718813563063e00361440459</hash>
            <hash_type>MD5</hash_type>
            <filesize>708</filesize>
            <mtime>2018-06-13 13:42:05</mtime>
            <url>https://researchdata.gla.ac.uk/588/2/README</url>
          </file>
        </files>
        <eprintid>588</eprintid>
        <pos>2</pos>
        <placement>2</placement>
        <mime_type>text/plain</mime_type>
        <format>Text</format>
        <language>en</language>
        <security>public</security>
        <license>cc_by_4</license>
        <main>README</main>
        <content>readme</content>
      </document>
    </documents>
    <eprint_status>archive</eprint_status>
    <userid>6975</userid>
    <dir>disk0/00/00/05/88</dir>
    <datestamp>2018-03-12 10:01:39</datestamp>
    <lastmod>2019-01-23 15:44:39</lastmod>
    <status_changed>2018-03-12 10:01:39</status_changed>
    <type>data_collection</type>
    <metadata_visibility>show</metadata_visibility>
    <creators>
      <item>
        <name>
          <family>Katsarou</family>
          <given>Foteini</given>
        </name>
        <enlightenid>41640</enlightenid>
      </item>
    </creators>
    <uniqueid>glaresearchdata:2018-03-12-588</uniqueid>
    <title>Improving the Performance and Scalability of Pattern Subgraph Queries</title>
    <ispublished>pub</ispublished>
    <divisions>
      <item>30200000</item>
    </divisions>
    <note>This work was funded by the University of Glasgow</note>
    <abstract>The data provided include the datasets used the PhD thesis titled &quot;Improving the Performance and Scalability of Pattern Subgraph Queries&quot;. The thesis contains 7 chapters., from which chapters 4,5 and 6 are related to the dataset provided. The rest of the chapters serve as introductory and concluding material to the thesis. All datasets follow 2 different file formats: grapes and igraph format as described in the README file.

Most of the datasets provided are generated with the synthetic generator GraphGen ( http://www.cse.ust.hk/graphgen/). For the the real datasets: they were obtained as follows:
AIDS, PDBS, PCM, PPI were retrieved from authors of Grapes. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3805575/ Unfortunately the link at which they maintained the dataset does not exist anymore.

Human and yeast were retrieved from J. Lee, W.-S. Han, R. Kasperovics, and J.-H. Lee, “An in-depth comparison of sub-graph isomorphism algorithms in graph databases,” PVLDB, vol. 6, no. 2, pp. 133–144, 2012.

Wordnet is obtained form  http://vlado.fmf.uni-lj.si/pub/networks/data/dic/Wordnet/Wordnet.htm.</abstract>
    <date>2018-03-12</date>
    <date_type>published</date_type>
    <publisher>University of Glasgow</publisher>
    <id_number>10.5525/gla.researchdata.588</id_number>
    <data_type>
      <item>Mixed</item>
    </data_type>
    <copyright_holders>
      <item>The Creators</item>
    </copyright_holders>
    <pending>FALSE</pending>
    <language>English</language>
    <collection_date>
      <date_from>2015</date_from>
      <date_to>2017</date_to>
    </collection_date>
    <retention_date>2028-03-28</retention_date>
    <retention_action>R</retention_action>
    <ethics_consent_required>FALSE</ethics_consent_required>
    <request_copy>FALSE</request_copy>
    <repo_link>
      <item>
        <title>Performance and scalability of indexed subgraph query processing methods</title>
        <link>http://eprints.gla.ac.uk/id/eprint/107199</link>
      </item>
      <item>
        <title>Subgraph Querying with Parallel Use of Query Rewritings and Alternative Algorithms</title>
        <link>http://eprints.gla.ac.uk/id/eprint/130142</link>
      </item>
      <item>
        <title>Hybrid Algorithms for Subgraph Pattern Queries in Graph Databases</title>
        <link>http://eprints.gla.ac.uk/id/eprint/151573</link>
      </item>
    </repo_link>
  </eprint>
</eprints>
