Bilgilendirme: Kurulum ve veri kapsamındaki çalışmalar devam etmektedir. Göstereceğiniz anlayış için teşekkür ederiz.
 

Spend: Linked Data Sparql Endpoints Discovery Using Search Engines

dc.contributor.author Yumusak, Semih
dc.contributor.author Dogdu, Erdogan
dc.contributor.author Kodaz, Halife
dc.contributor.author Kamilaris, Andreas
dc.contributor.author Vandenbussche, Pierre-Yves
dc.date.accessioned 2025-05-13T21:54:17Z
dc.date.available 2025-05-13T21:54:17Z
dc.date.issued 2017
dc.description Yumusak, Semih/0000-0002-8878-4991; Vandenbussche, Pierre-Yves/0000-0003-0591-6109; Dogdu, Erdogan/0000-0001-5987-0164; Kodaz, Halife/0000-0001-8602-4262 en_US
dc.description.abstract Linked data endpoints are online query gateways to semantically annotated linked data sources. In order to query these data sources, SPARQL query language is used as a standard. Although a linked data endpoint (i.e. SPARQL endpoint) is a basic Web service, it provides a platform for federated online querying and data linking methods. For linked data consumers, SPARQL endpoint availability and discovery are crucial for live querying and semantic information retrieval. Current studies show that availability of linked datasets is very low, while the locations of linked data endpoints change frequently. There are linked data respsitories that collect and list the available linked data endpoints or resources. It is observed that around half of the endpoints listed in existing repositories are not accessible (temporarily or permanently offline). These endpoint URLs are shared through repository websites, such as Datahub. io, however, they are weakly maintained and revised only by their publishers. In this study, a novel metacrawling method is proposed for discovering and monitoring linked data sources on the Web. We implemented the method in a prototype system, named SPARQL Endpoints Discovery (SpEnD). SpEnD starts with a "search keyword" discovery process for finding relevant keywords for the linked data domain and specifically SPARQL endpoints. Then, the collected search keywords are utilized to find linked data sources via popular search engines (Google, Bing, Yahoo, Yandex). By using this method, most of the currently listed SPARQL endpoints in existing endpoint repositories, as well as a significant number of new SPARQL endpoints, have been discovered. We analyze our findings in comparison to Datahub collection in detail. en_US
dc.identifier.doi 10.1587/transinf.2016DAP0025
dc.identifier.issn 0916-8532
dc.identifier.issn 1745-1361
dc.identifier.scopus 2-s2.0-85015258171
dc.identifier.uri https://doi.org/10.1587/transinf.2016DAP0025
dc.identifier.uri https://hdl.handle.net/20.500.12416/9996
dc.language.iso en en_US
dc.publisher Ieice-inst Electronics information Communication Engineers en_US
dc.relation.ispartof 8th Forum on Data Engineering and Information Management (DEIM) -- MAR, 2016 -- Fukuoka, JAPAN en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Linked Data en_US
dc.subject Semantic Web en_US
dc.subject Sparql Endpoint en_US
dc.subject Endpoint Discovery en_US
dc.subject Metasearch en_US
dc.subject Knowledge Graph en_US
dc.title Spend: Linked Data Sparql Endpoints Discovery Using Search Engines en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.id Yumusak, Semih/0000-0002-8878-4991
gdc.author.id Vandenbussche, Pierre-Yves/0000-0003-0591-6109
gdc.author.id Dogdu, Erdogan/0000-0001-5987-0164
gdc.author.id Kodaz, Halife/0000-0001-8602-4262
gdc.author.scopusid 56814988500
gdc.author.scopusid 6603501593
gdc.author.scopusid 8945093700
gdc.author.scopusid 36189564000
gdc.author.scopusid 55603733500
gdc.author.wosid Yumusak, Semih/Y-1134-2019
gdc.author.wosid Kamilaris, Andreas/H-8744-2019
gdc.author.wosid Kodaz, Halife/Abg-2951-2020
gdc.author.wosid Vandenbussche, Pierre-Yves/G-1496-2019
gdc.author.wosid Kodaz, Halife/Q-2141-2015
gdc.bip.impulseclass C4
gdc.bip.influenceclass C4
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department Çankaya University en_US
gdc.description.departmenttemp [Yumusak, Semih] KTO Karatay Univ, Comp Engn Dept, Konya, Turkey; [Dogdu, Erdogan] Cankaya Univ, Comp Engn Dept, Ankara, Turkey; [Kodaz, Halife] Selcuk Univ, Comp Engn Dept, Konya, Turkey; [Kamilaris, Andreas] Insight Res Ctr Data Analyt, Istanbul, Turkey; [Vandenbussche, Pierre-Yves] Fujitsu Ireland Ltd, Swords, Ireland en_US
gdc.description.endpage 767 en_US
gdc.description.issue 4 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage 758 en_US
gdc.description.volume E100D en_US
gdc.description.woscitationindex Science Citation Index Expanded - Conference Proceedings Citation Index - Science
gdc.description.wosquality Q4
gdc.identifier.openalex W2500009570
gdc.identifier.wos WOS:000399371100018
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 11.0
gdc.oaire.influence 3.7379104E-9
gdc.oaire.isgreen true
gdc.oaire.keywords FOS: Computer and information sciences
gdc.oaire.keywords semantic Web
gdc.oaire.keywords knowledge graph
gdc.oaire.keywords 600
gdc.oaire.keywords metasearch
gdc.oaire.keywords linked data
gdc.oaire.keywords endpoint discovery
gdc.oaire.keywords SPARQL endpoint
gdc.oaire.keywords Information Retrieval (cs.IR)
gdc.oaire.keywords 004
gdc.oaire.keywords Computer Science - Information Retrieval
gdc.oaire.popularity 5.5738134E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 6.71780446
gdc.openalex.normalizedpercentile 0.96
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 16
gdc.plumx.crossrefcites 1
gdc.plumx.mendeley 30
gdc.plumx.scopuscites 21
gdc.scopus.citedcount 21
gdc.virtual.author Doğdu, Erdoğan
gdc.wos.citedcount 16
relation.isAuthorOfPublication 0d453674-7998-4d57-a06c-03e13bb1e314
relation.isAuthorOfPublication.latestForDiscovery 0d453674-7998-4d57-a06c-03e13bb1e314
relation.isOrgUnitOfPublication 12489df3-847d-4936-8339-f3d38607992f
relation.isOrgUnitOfPublication 43797d4e-4177-4b74-bd9b-38623b8aeefa
relation.isOrgUnitOfPublication 0b9123e4-4136-493b-9ffd-be856af2cdb1
relation.isOrgUnitOfPublication.latestForDiscovery 12489df3-847d-4936-8339-f3d38607992f

Files