Skip to topic | Skip to bottom
Home
Main
Main.InformationTopicsr1.8 - 27 Sep 2007 - 16:11 - Main.novatopic end

Start of topic | Skip to actions

Files and Database topics

File processing
  • File structures
    • Record
      • Fixed
      • Variable length
      • Field
        • Type
  • File access
    • Sequential
    • Direct
  • File organization
    • Sequential
    • Hashed
      • Hash cluster
    • Indexed
      • Indexing techniques
        • B-tree based indexing
          • Dynamic, multilevel indexes
        • Hash-based indexing
      • Indexing challenges
        • Files with dense index
        • Files with variable length records
    • Performance issues
    • Storage requirements for different atomic data types
  • I/O operations
  • Physical and logical files
  • Buffer management

Database systems

  • Database vs. flat files
  • Data independence
  • Components of database systems
    • Data
    • Data dictionary
    • Application programs
    • Users
    • Database administration
  • Scalability
  • Efficiency
    • Throughput
    • Response time
  • Database architecture
  • Types of databases
    • Relational databases
    • Object-oriented databases
    • Rule-based databases
    • XML databases
  • Database machines
  • Data and information modeling
    • Data model
      • Conceptual data model / Semantic data model
        • Entity-relationship model
          • Entity type
          • Relationship type
          • Attribute type
          • Enhanced entity-relationship model
        • Object-oriented data model
        • Specific modeling languages
          • ER Diagram
          • UML Class diagram
          • IDEF1
        • Patterns and standard models
        • Analysis of data requirements
        • Identification of business rules
        • Information abstraction
      • Logical data model
        • Database schema
        • Hierarchical data model
        • Network data model
        • Relational data model
          • Relational database schema
            • Relation
              • Base relation
              • Virtual relation
              • Relational structure
                • Attribute
                  • Candidate key
                • Domain
                • Constraint
                  • Entity integrity
                    • Primary key
                  • Referential integrity
                    • Foreign key
                  • Domain integrity
                  • Functional dependency
            • Database constraint
          • Content
            • Tuples
          • Relational manipulation operations
            • Relational algebra operations
            • Relational calculus
          • Relational database design
            • Mapping conceptual schema to a relational schema
            • Normalization
              • Normal form
                • 1NF
                • 2NF
                • 3NF
                • BCNF
                • 4NF
                • 5NF
              • Anomaly
              • Multivalued dependency
              • Joint dependency
      • Physical data model
      • Data model for data integration (data warehousing, data marts)
        • Dimensional model
        • Star schema
    • CASE tools in data modeling
  • Database languages
    • Query languages
      • Query processing
      • Query optimization
    • Data definition languages (DDL)
    • Data manipulation languages (DML)
    • SQL
      • SQL as DDL
        • Constraints
        • Integrity enforcement
      • SQL as DML
      • SQL Optimization techniques
    • QBE and 4th-generation environments
      • Reporting languages and tools
    • Persistent programming languages
    • Object Query Language
    • XQuery
    • XPath
    • Stored procedures
    • Triggers
  • Transaction processing
    • Transactions
    • Serializability
      • Locking
      • Timestamps
    • Efficiency
    • Failure and recovery
    • Concurrency control
      • Isolation mechanisms
  • Distributed databases
    • Distributed data storage
      • Data fragmentation
      • Data replication
      • Data allocation
      • Horizontal partitioning
      • Vertical partitioning
    • Distributed query processing
    • Distributed transaction model
      • Two-phase commit
    • Distributed concurrency control
      • Distinguished copy technique
      • Voting method
    • Homogeneous
    • Heterogeneous
      • Data translation
      • Program translation
    • Federated
  • Parallel databases
  • Physical database design
    • Specifying characteristics of fields
      • Selecting data type
      • Ensuring data integrity
      • Handling missing data
    • Database tuning
    • Partitioning
      • Horizontal
      • Vertical
    • Characteristics of physical storage devices
    • Data compression
    • File processing
    • Sarbanes-Oxley compliance ### Does this really belong here??

Decision support

  • On-line analytical processing
  • Data integration
    • Data warehouses
    • Data marts
  • Data mining
    • Uses of data mining
    • Patterns
      • Association rules
      • Clustering
      • Frequent sets
    • Data cleaning
    • Data visualization
    • Effects of data problems on data mining results
      • Noise
      • Redundancy
      • Outliers

Storage and retrieval of unstructured information

  • Content analysis and indexing
    • Classification and categorization
      • Classification techniques
        • Metadata
        • Thesauri
        • Ontologies
      • Bibliographic information, bibliometrics, citations
      • Integration of citation, keyword, classification scheme, and other terms
      • Trees, inverted files, PAT trees, signature files, indexing
      • Morphological analysis, stemming, phrases, stop lists
    • Term frequency distributions, uncertainty, fuzziness, weighting
    • Vector space, probabilistic, logical, and advanced models
    • Summarization and visualization
    • Abstracting methods
    • Dictionaries
  • Information search and information retrieval
    • Effectiveness: precision and recall
    • Clustering
    • Information filtering
    • Relevance feedback
    • Retrieval process
      • Query formulation
    • Search process and strategy
    • Selection process
    • User modeling
      • Information seeking behavior
      • Information need analysis
  • Documents, electronic publishing
  • Routing and (community) filtering
  • Protocols and systems (including Z39.50, OPACs, WWW engines, research systems)

Storage and retrieval of semistructured information

  • Web data
  • Markup language
    • HTML
    • SGML
    • XML
      • tagging
      • document nodes
      • element nodes
      • attribute nodes
      • text nodes
      • document order
      • well-formedness
      • namespace
      • DTD
      • XML Schema
        • Validity
        • Simple types
        • Complex types
        • Anonymous types
        • Key
        • Refkey
  • Query and restructuring language
    • XPath
    • XQuery
    • XSLT

Hypertext and hypermedia

  • Hypertext models (early history, web, Dexter, Amsterdam, HyTime?)
  • Link services, engines, and (distributed) hypertext architectures
  • Nodes, composites, and anchors
  • Dimensions, units, locations, spans
  • Browsing, navigation, views, zooming
  • Automatic link generation
  • Presentation, transformations, synchronization
  • Authoring, reading, and annotation
  • Protocols and systems (including web, HTTP)

Multimedia information and systems (should multimedia information and systems be separated???)

  • Devices, device drivers, control signals and protocols, DSPs
  • Applications, media editors, authoring systems, and authoring
  • Streams/structures, capture/represent/transform, spaces/ domains, compression/coding
  • Content-based analysis, indexing, and retrieval of audio, images, and video
  • Presentation, rendering, synchronization, multi-modal integration/interfaces
  • Real-time delivery, quality of service, audio/video conferencing, video-on-demand

Digital libraries (why not an application area for unstructured information)?

  • Collection
    • Digitization, storage, and interchange
    • Digital objects, composites, and packages
  • Dissemination
  • Metadata, cataloging, author submission
  • Naming, repositories, archives
  • Spaces (conceptual, geographical, 2/3D, VR)
  • Architectures (agents, buses, wrappers/mediators), interoperability
  • Services (searching, linking, browsing, and so forth)
  • Intellectual property rights management, privacy, protection (watermarking)
  • Archiving and preservation, integrity
  • Standards
  • Systems issues
  • User experience

Properties of data

  • Quality
  • Completeness
  • Consistency
  • Accuracy
  • Timeliness

Managing the Database Environment

  • Roles and responsibilities of database administrator function
  • Database administration
    • Transaction management and concurrency
      • Deadlock avoidance
    • Recovery from failures
    • Backup, recovery, and restart
    • Redundancy
    • Replication
    • Logging
    • Optimization
    • Security
      • Encryption
      • Prevention of unauthorized access
      • Protection against malware
    • Privacy
    • Ownership and access control; authorization techniques
  • Data management audits
  • Data management architectures
    • Host-based
    • File server
    • Client-server
    • n-tier

Database Application Interface

  • ODBC
  • JDBC
  • Web services
    • SOAP
    • WSDL
    • UDDI

Special purpose databases

  • Temporal databases
  • Spatial databases and GIS
  • Scientific databases
    • Statistical databases
  • Multimedia databases
    • Image databases
  • Textual databases

Knowledge management

Data storage representations

  • Composite structures
  • Contiguous representations
  • Hash-table representations
  • Linked representations
  • Object representation
  • Primitive data items

  • Information abstraction

to top

You are here: Main > VillanovaWikiPages > OntologyProject > InformationTopics

to top

Copyright © 1999-2009 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback