1. Concepts



Data source

The source of the data is the person or organisation that developed the data. 
  • Data may come from informal sources, such as a conversation, meeting or observations.  
    • It may also be obtained from formal sources, such as a report, book or official document. 
  • A formal source often provides data that is logically organised.  
  • However, this is no guarantee to its accuracy. 
  • The source of the data is protected by the Copyright Act. 
  • People are not allowed to copy the work of another person without permission. 
  • It is against the law to violate copyright. 
  • Most data obtained from the Internet is protected by copyright. 
  • Text and images obtained from a Web site should not be used without acknowledgement of and permission from the owner. 
  • However, the Copyright Act does make special provisions for students to use information for research purposes. 
    • It allows students to use a reasonable portion of the original work if the work is correctly cited. 
    • To cite an Internet source, include the following information: 
      • author’s surname and initial or organisation’s name, 
      • title of the complete work or Web page, 
      • URL of the Web page, 
      • date of the document or Web page publication, and 
      • the download date. 

Accuracy of data

Accuracy of data is the extent to which it is free from errors. Data that is collected on individuals is not always accurate.The inaccuracies may be caused by mistakes in gathering or entering the data, by a mismatch of the data and the person, or by out-of-date information. For example, a car recorded as stolen is later recovered and returned to the owner. If the data has not been quickly updated, the legal owner runs the risk of being arrested as a car thief. Opportunities to check and change incorrect data need to be provided. It is often necessary to compare data from a number of different sources to determine which data is accurate.

Data validation is used to check the entry of data:
  • • A range check is used if the data is restricted to a small range of particular values. For example, when the data is entered in the format dd/mm/yyyy, a range check could be used to determine whether the first two digits are in the range of 1 to 31 and whether the second two digits are in the range of 1 to 12. 
  • • A list check is used when the data can be compared to a set of accepted data. For example, when the data entered is an Australian state, then NSW would be accepted but not NSX.
  • • A type check is used to determine whether the data type is correct (see Figure 2.28). For example, when the data is entered for a person’s family name the software will check whether the data is text. It would not accept a ‘date’ for a person’s name.
  • • A check digit is a digit calculated from the digits of a code number and then added to that number as an extra digit. For example, the ISBN number 0 85859 921 319 4 has a check digit of 4, and the check digit is calculated from 0 85859 921 319.

Data Integrity

Data integrity describes the reliability of the data. Reliable data is accurate,
current and relevant. There is no guarantee that data received from the Internet is
reliable. It is easy to publish information on the Internet. Organisations that are
not the creators of the information often provide access to it. Who is responsible
for the accuracy of the information? Is the creator the only person responsible for
the accuracy of the data? Do organisations that provide access to information have
a responsibility to verify its accuracy? Clearly it is necessary to compare data from
a number of sources and to determine which data is accurate.

Privacy

Privacy is the ability of an individual to control personal data. It is a major concern
for those involved with databases and their use. Large organisations, such as banks,
use databases to store information about their customers. This information is
confidential. It should not be used for other purposes or be made available to
anyone outside the organisation. Privacy is also eroded by linking databases. People
leave a trail of electronic data when they use a credit card, visit the doctor, use the
library, access the Internet or subscribe to a magazine. If this data was combined,
it could create a fairly detailed picture of a person.

Access to data

Access to data is the extent data is available to people. Should all information be free and available to all? What kind of access should individuals have to information about themselves? Why can’t people have free access to data about themselves located in business databases? Do people have the right to access and check their credit history or medical data? The Freedom of Information Act is designed to allow individuals to find out what data is being kept by  the government and other public bodies. It states that individuals have the right to access information where it relates to the individual and does not invade another person’s privacy. There is a fee for the release of this information.

Freedom of information The Office of the

https://www.oaic.gov.au/freedom-of-information/

However, knowing what data is being kept does not indicate how it will be used. People can use our preferences, weaknesses and habits to their advantage. That is, data on what we buy, how we pay for it, what we read, what we watch on television and how much we earn is valuable data for those who want to sell us things. 

Ownership and control of this data is an ethical issue. 
Who owns this data? Is it the individual, or is it the organisation that has processed the data? Is the data being combined, sold and exchanged? For example, an application for an extension to a house is sent to the local council for approval. Who owns this data? Does the building inspector have the right to sell this processed data to a bricklayer? On the other hand, might the owner of the house appreciate another quote for laying bricks?

source: IPT HSC Course Text Book - G.K Powers. Published by Heinenmann
Comments