Files and Database topics
File processing
- File structures
- Record
- Fixed
- Variable length
- Field
- File access
- File organization
- Sequential
- Hashed
- Indexed
- Indexing techniques
- B-tree based indexing
- Dynamic, multilevel indexes
- Hash-based indexing
- Indexing challenges
- Files with dense index
- Files with variable length records
- Performance issues
- Storage requirements for different atomic data types
- I/O operations
- Physical and logical files
- Buffer management
Database systems
- Database vs. flat files
- Data independence
- Components of database systems
- Data
- Data dictionary
- Application programs
- Users
- Database administration
- Scalability
- Efficiency
- Database architecture
- Types of databases
- Relational databases
- Object-oriented databases
- Rule-based databases
- XML databases
- Database machines
- Data and information modeling
- Data model
- Conceptual data model / Semantic data model
- Entity-relationship model
- Entity type
- Relationship type
- Attribute type
- Enhanced entity-relationship model
- Object-oriented data model
- Specific modeling languages
- ER Diagram
- UML Class diagram
- IDEF1
- Patterns and standard models
- Analysis of data requirements
- Identification of business rules
- Information abstraction
- Logical data model
- Database schema
- Hierarchical data model
- Network data model
- Relational data model
- Relational database schema
- Relation
- Base relation
- Virtual relation
- Relational structure
- Attribute
- Domain
- Constraint
- Entity integrity
- Referential integrity
- Domain integrity
- Functional dependency
- Database constraint
- Content
- Relational manipulation operations
- Relational algebra operations
- Relational calculus
- Relational database design
- Mapping conceptual schema to a relational schema
- Normalization
- Normal form
- Anomaly
- Multivalued dependency
- Joint dependency
- Physical data model
- Data model for data integration (data warehousing, data marts)
- Dimensional model
- Star schema
- CASE tools in data modeling
- Database languages
- Query languages
- Query processing
- Query optimization
- Data definition languages (DDL)
- Data manipulation languages (DML)
- SQL
- SQL as DDL
- Constraints
- Integrity enforcement
- SQL as DML
- SQL Optimization techniques
- QBE and 4th-generation environments
- Reporting languages and tools
- Persistent programming languages
- Object Query Language
- XQuery
- XPath
- Stored procedures
- Triggers
- Transaction processing
- Transactions
- Serializability
- Efficiency
- Failure and recovery
- Concurrency control
- Distributed databases
- Distributed data storage
- Data fragmentation
- Data replication
- Data allocation
- Horizontal partitioning
- Vertical partitioning
- Distributed query processing
- Distributed transaction model
- Distributed concurrency control
- Distinguished copy technique
- Voting method
- Homogeneous
- Heterogeneous
- Data translation
- Program translation
- Federated
- Parallel databases
- Physical database design
- Specifying characteristics of fields
- Selecting data type
- Ensuring data integrity
- Handling missing data
- Database tuning
- Partitioning
- Characteristics of physical storage devices
- Data compression
- File processing
- Sarbanes-Oxley compliance ### Does this really belong here??
Decision support
- On-line analytical processing
- Data integration
- Data warehouses
- Data marts
- Data mining
- Uses of data mining
- Patterns
- Association rules
- Clustering
- Frequent sets
- Data cleaning
- Data visualization
- Effects of data problems on data mining results
- Noise
- Redundancy
- Outliers
Storage and retrieval of unstructured information
- Content analysis and indexing
- Classification and categorization
- Classification techniques
- Metadata
- Thesauri
- Ontologies
- Bibliographic information, bibliometrics, citations
- Integration of citation, keyword, classification scheme, and other terms
- Trees, inverted files, PAT trees, signature files, indexing
- Morphological analysis, stemming, phrases, stop lists
- Term frequency distributions, uncertainty, fuzziness, weighting
- Vector space, probabilistic, logical, and advanced models
- Summarization and visualization
- Abstracting methods
- Dictionaries
- Information search and information retrieval
- Effectiveness: precision and recall
- Clustering
- Information filtering
- Relevance feedback
- Retrieval process
- Search process and strategy
- Selection process
- User modeling
- Information seeking behavior
- Information need analysis
- Documents, electronic publishing
- Routing and (community) filtering
- Protocols and systems (including Z39.50, OPACs, WWW engines, research systems)
Storage and retrieval of semistructured information
- Web data
- Markup language
- HTML
- SGML
- XML
- tagging
- document nodes
- element nodes
- attribute nodes
- text nodes
- document order
- well-formedness
- namespace
- DTD
- XML Schema
- Validity
- Simple types
- Complex types
- Anonymous types
- Key
- Refkey
- Query and restructuring language
Hypertext and hypermedia
- Hypertext models (early history, web, Dexter, Amsterdam, HyTime?)
- Link services, engines, and (distributed) hypertext architectures
- Nodes, composites, and anchors
- Dimensions, units, locations, spans
- Browsing, navigation, views, zooming
- Automatic link generation
- Presentation, transformations, synchronization
- Authoring, reading, and annotation
- Protocols and systems (including web, HTTP)
Multimedia information and systems (should multimedia information and systems be separated???)
- Devices, device drivers, control signals and protocols, DSPs
- Applications, media editors, authoring systems, and authoring
- Streams/structures, capture/represent/transform, spaces/ domains, compression/coding
- Content-based analysis, indexing, and retrieval of audio, images, and video
- Presentation, rendering, synchronization, multi-modal integration/interfaces
- Real-time delivery, quality of service, audio/video conferencing, video-on-demand
Digital libraries (why not an application area for unstructured information)?
- Collection
- Digitization, storage, and interchange
- Digital objects, composites, and packages
- Dissemination
- Metadata, cataloging, author submission
- Naming, repositories, archives
- Spaces (conceptual, geographical, 2/3D, VR)
- Architectures (agents, buses, wrappers/mediators), interoperability
- Services (searching, linking, browsing, and so forth)
- Intellectual property rights management, privacy, protection (watermarking)
- Archiving and preservation, integrity
- Standards
- Systems issues
- User experience
Properties of data
- Quality
- Completeness
- Consistency
- Accuracy
- Timeliness
Managing the Database Environment
- Roles and responsibilities of database administrator function
- Database administration
- Transaction management and concurrency
- Recovery from failures
- Backup, recovery, and restart
- Redundancy
- Replication
- Logging
- Optimization
- Security
- Encryption
- Prevention of unauthorized access
- Protection against malware
- Privacy
- Ownership and access control; authorization techniques
- Data management audits
- Data management architectures
- Host-based
- File server
- Client-server
- n-tier
Database Application Interface
Special purpose databases
- Temporal databases
- Spatial databases and GIS
- Scientific databases
- Multimedia databases
- Textual databases
Knowledge management
Data storage representations
- Composite structures
- Contiguous representations
- Hash-table representations
- Linked representations
- Object representation
- Primitive data items
to top