|
1
|
|
|
2
|
- Introduction to Semantic Web
- Semantic Web Architecture
- XML
- Resource Description Framework (RDF)
- Technologies and Applications
|
|
3
|
- Web was launched in early 1990s
- Exponential growth in mid-1990s
- Search engines took off as tool for finding resources
- Self-publishing increased amount of low quality hits from search
engines
- Spammers attempt to fool search engines
- Difficulties in using the Web to find relevant resources
|
|
4
|
- "The Semantic Web is an extension of the current web in which
information is given well-defined meaning, better enabling computers and
people to work in cooperation." -- Tim Berners-Lee, James Hendler,
Ora Lassila, The Semantic Web, Scientific American, May 2001
- The Semantic Web is the abstract representation of data on the World
Wide Web (based on the RDF standards and other standards to be defined).
(http://www.w3.org/2001/sw/)
|
|
5
|
- Machines talking to machines – semantics need to be unambiguously
declared
- Joined-up data – enabling complex tasks based on information from
various sources
- Wide scope – from, say, home to government to commerce
- Trust – both in data and who is saying it
- This is not going to be easily achieved
|
|
6
|
|
|
7
|
|
|
8
|
- Resources:
- identified by URI's
- untyped
- Links:
- href, src, ...
- limited, non-descriptive
- Humans:
- Characteristics of the documents
is (normally) clear to
those with a grasp of English.
- Machines:
- Very little structured information available for automated inference
and/or delegation of tasks to an agent.
|
|
9
|
- Resources:
- Common naming syntax (URI's)
- Links:
- Common naming syntax (URI's)
- User:
- Machine:
- More processable information is available
- Computers and people:
- Work, learn and exchange knowledge effectively
|
|
10
|
|
|
11
|
- Enabled standard communication of content combined with format
- Loosely defined specific (limited) grammar specified using SGML
(specific language)
- Started by Tim Berners-Lee
- Standardized by World-Wide Web Consortium (W3C)
- Ubiquitous method of presenting and communicating data
- Used throughout DoD on both public internet and SIPRNET and other
classified networks
- Not useful for machine search
|
|
12
|
- Separates content from format (standard syntax)
- Simplified version of SGML (metalanguage) for defining eXtensible tag
sets
- Started by Tim Bray et al based on conversations with Tim Berners-Lee
- Standardized by W3C
- Used to define updated HTML grammar (XHTML)
- Used in Modeling and Simulation community primarily for Data Interchange
Formats (DIFs) that support data sharing
|
|
13
|
|
|
14
|
- XML:
- Should be used to add tags
- Is extensible (DC qualifiers)
- But:
- XML describes the syntax
- Does not provide semantics (what does DC.Creator mean?)
- The meaning may be agreed & understood within DC applications – but
this does not allow for extensibility
- Similar applications may be described using different XML DTDs: e.g. is <Creator> the same as
<le-Créator> or <Доклады>
|
|
15
|
- Synonymy and polysemy
- <PERSON> vs. <INDIVIDUAL>
- is <SPIDER> an arachnid or software?
- Structural differences
- <PERSON><NAME>Lee Lacy</NAME><PERSON> vs.
- <PERSON><NAME> <FNAME>Lee</FNAME> <LNAME>Lacy</LNAME> </NAME></PERSON>
|
|
16
|
- RDF data consists of nodes and attached attribute/value pairs
- Nodes can be any web resources
- Attributes are named properties of nodes
- Values are either atomic (text strings, numbers, etc.) or other
resources or metadata instances
- Supports labeled directed graphs
- XML used as graph serialization syntax for storing and communicating RDF
instances
- Provides basic ontological primitives
- Classes and relations (properties)
- Class (and property) hierarchy
- RDF triples assert facts about resources
|
|
17
|
- Anything can be a resource.
- Resources have URIs.
- Resources have properties.
- Properties have values and types.
- An RDF document makes statements about resources and their properties.
|
|
18
|
- “Jan Hanford created the J.S. Bach homepage.”
- The J.S. Bach homepage is a resource
- It has a URI (http://www.jsbach.org)
- It has a property
- The property has a type of “creator”
- The property has a value of “Jan Hanford”
|
|
19
|
|
|
20
|
|
|
21
|
- Resource Description Framework
- An XML application
- “Not just tags” – RDF makes use of a formal model
- Basis for “The Semantic Web” (SW)
|
|
22
|
|
|
23
|
- AI (symbolic) Knowledge Representation Techniques – Semantic Networks
and Frames
- Metadata
- Entity-Relationship (ER) - RDBMS
- Object-Oriented Modeling
- XML for file format
- URIs for identifiers
- Limited semantics (node-edge-node)
|
|
24
|
- Expressive inadequacy
- Only range/domain constraints (on properties)
- No properties of properties (unique, transitive, inverse etc.)
- No equivalence, disjointness, coverings etc.
- No necessary and sufficient conditions (for class membership)
- Poorly (un) defined semantics
|
|
25
|
- Provides agent-readable descriptions of data, information, and knowledge
- Built on top of XML and RDF
- Envisioned by Tim Berners-Lee and researched by DARPA team and others
- W3C is beginning a Semantic Web initiative
- Used to define ontologies and associated instance data
- Huge potential for Modeling and Simulation community
|
|
26
|
- More meta-information, such as
- Properties of each relation (transitive, inverse, symmetric, etc.)
- Taxonomic hierarchies
- Uniqueness constraints, Cardinality, etc.
- Ontology community exists- OWL, etc.
|
|
27
|
|
|
28
|
- Developed by DARPA team of researchers
- Language based on description logic technology
- Starting point for W3C semantic web initiative (OWL is replacing DAML)
- Adds additional language features for describing ontologies
- Builds on RDF(S)
|
|
29
|
- Based on FBI website information
- Potential applications for “watch list” matching
- Description properties include:
- Place of birth
- FBI caution
- Physical description
- Languages spoken
|
|
30
|
|
|
31
|
|
|
32
|
- Inference rules
- Query Languages
- Evolution rules
- Universal language for monotonic logic
- Some engines exist (SQL to KIF, Cycl, etc)
|
|
33
|
- All statements on the Web occur in some context.
- Applications need this context in order to evaluate the trustworthiness
of the statements.
- The machinery of the Semantic Web does not assert that all statements
found on the Web are "true".
|
|
34
|
- Ontologies
- Catalogs on the Web
- Electronic commerce transactions
- Operating across many applications (e.g., PDAs)
- Intelligent Agents
- ?
|
|
35
|
- Resources
- http://www.eece.memphis.edu/ksl/ks1/hail001.xml
- http://www.eece.memphis.edu/ksl/ks1/sebring9210.xml
- http://www.eece.memphis.edu/ksl/ks1/engine001.xml
- http://www.eece.memphis.edu/ksl/ks1/sebring9210damage01.xml
- http://www.eece.memphis.edu/ksl/ks1/lili.xml
- Ontology
- http://www.eece.memphis.edu/ksl/uofm_eece.xml
|
|
36
|
- The first version of the Web lacked a metadata framework which was
needed to describe resources
- W3C developed RDF to provide this framework
- The Semantic Web will be based on registries of machine-understandable
definitions
- The Semantic Web will be difficult to achieve
- Applications are needed to show the utility of the Semantic Web
infrastructure currently being developed.
|