Conference paper Open Access

A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization

Apostolidis, Evlampios; Metsai, Alexandros; Adamantidou, Eleni; Mezaris, Vasileios; Patras, Ioannis


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/d27be899-927e-4ebd-8309-015c5bd1a71a/Apostolidis_Summarization.pdf"
      }, 
      "checksum": "md5:756d6784b1ce31dbae500bd0d794d2cf", 
      "bucket": "d27be899-927e-4ebd-8309-015c5bd1a71a", 
      "key": "Apostolidis_Summarization.pdf", 
      "type": "pdf", 
      "size": 1442696
    }
  ], 
  "owners": [
    22750
  ], 
  "doi": "10.1145/3347449.3357482", 
  "stats": {
    "version_unique_downloads": 57.0, 
    "unique_views": 291.0, 
    "views": 303.0, 
    "version_views": 303.0, 
    "unique_downloads": 57.0, 
    "version_unique_views": 291.0, 
    "volume": 90889848.0, 
    "version_downloads": 63.0, 
    "downloads": 63.0, 
    "version_volume": 90889848.0
  }, 
  "links": {
    "doi": "https://doi.org/10.1145/3347449.3357482", 
    "latest_html": "https://zenodo.org/record/3395967", 
    "bucket": "https://zenodo.org/api/files/d27be899-927e-4ebd-8309-015c5bd1a71a", 
    "badge": "https://zenodo.org/badge/doi/10.1145/3347449.3357482.svg", 
    "html": "https://zenodo.org/record/3395967", 
    "latest": "https://zenodo.org/api/records/3395967"
  }, 
  "created": "2019-09-06T09:04:59.431992+00:00", 
  "updated": "2020-01-20T17:21:07.455803+00:00", 
  "conceptrecid": "3395966", 
  "revision": 7, 
  "id": 3395967, 
  "metadata": {
    "access_right_category": "success", 
    "embargo_date": "2019-10-21", 
    "doi": "10.1145/3347449.3357482", 
    "description": "<p>In this paper we present our work on improving the efficiency of adversarial training for unsupervised video summarization. Our starting point is the SUM-GAN model, which creates a representative summary based on the intuition that such a summary should make it possible to reconstruct a video that is indistinguishable from the original one. We build on a publicly available implementation of a variation of this model, that includes a linear compression layer to reduce the number of learned parameters and applies an incremental approach for training the different components of the architecture. After assessing the impact of these changes to the model&rsquo;s performance, we propose a stepwise, label-based learning process to improve the training efficiency of the adversarial part of the model. Before evaluating our model&rsquo;s efficiency, we perform a thorough study with respect to the used evaluation protocols and we examine the possible performance on two benchmarking datasets, namely SumMe and TVSum. Experimental evaluations and comparisons with the state of the art highlight the competitiveness of the proposed method. An ablation study indicates the benefit of each applied change on the model&rsquo;s performance, and points out the advantageous role of the introduced stepwise, label-based training strategy on the learning efficiency of the adversarial part of the architecture.</p>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3395966"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3395967"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "retv-h2020"
      }
    ], 
    "grants": [
      {
        "code": "780656", 
        "links": {
          "self": "https://zenodo.org/api/grants/10.13039/501100000780::780656"
        }, 
        "title": "Enhancing and Re-Purposing TV Content for Trans-Vector Engagement", 
        "acronym": "ReTV", 
        "program": "H2020", 
        "funder": {
          "doi": "10.13039/501100000780", 
          "acronyms": [], 
          "name": "European Commission", 
          "links": {
            "self": "https://zenodo.org/api/funders/10.13039/501100000780"
          }
        }
      }
    ], 
    "keywords": [
      "Video Summarization", 
      "Unsupervised Learning", 
      "Adversarial Training", 
      "Evaluation Protocol", 
      "Datasets"
    ], 
    "publication_date": "2019-10-21", 
    "creators": [
      {
        "affiliation": "CERTH-ITI, Thermi, Greece, and Queen Mary University of London, UK", 
        "name": "Apostolidis, Evlampios"
      }, 
      {
        "affiliation": "CERTH-ITI, Thermi, Greece", 
        "name": "Metsai, Alexandros"
      }, 
      {
        "affiliation": "CERTH-ITI, Thermi, Greece", 
        "name": "Adamantidou, Eleni"
      }, 
      {
        "affiliation": "CERTH-ITI, Thermi, Greece", 
        "name": "Mezaris, Vasileios"
      }, 
      {
        "affiliation": "Queen Mary University of London, UK", 
        "name": "Patras, Ioannis"
      }
    ], 
    "meeting": {
      "acronym": "AI4TV@ACMMM 2019", 
      "dates": "21 October 2019", 
      "place": "Nice, France", 
      "title": "1st Int. Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV'19) at ACM Multimedia 2019"
    }, 
    "access_right": "open", 
    "resource_type": {
      "subtype": "conferencepaper", 
      "type": "publication", 
      "title": "Conference paper"
    }
  }
}
303
63
views
downloads
Views 303
Downloads 63
Data volume 90.9 MB
Unique views 291
Unique downloads 57

Share

Cite as